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20  .^Abstract 

Parzen  (1979)  suggests  a  location  and  scale  model  for  the  quantile 
function  (Inverse  distribution  function)  of  a  random  variable.  We 
extend  this  model  to  the  two  sample  and  k-sample  problems  and  some 
results  are  given  which,  when  fully  implemented,  will  yield  more 
general  solutions  in  the  analysis  of  variance.  Most  of  the  work  here 
concerns  the  location  and  scale  model  suggested  by  Parzen  (1980)  for 
the  two  sample  problem  for  testing  the  equality  of  two  distribution 
functions  versus  local  alternatives. 

We  implement  this  model  (its  tests  and  estimators)  for  seven  under¬ 
lying  densities.  We  then  provide  criteria  for  choosing  or  determining 
whether  an  underlying  density  models  the  differences  of  the  two  samples 
adequately.  These  criteria  allow  one  to  choose  the  best  of  several 
underlying  densities  for  the  data.  We  illustrate  these  techniques  by 
analyzing  data  sets  from  the  literature  and  making  comparisons  with 
other  authors'  techniques.  We  also  show  how  the  Parzen  (1980)  model 
is  related  to  many  of  the  techniques  developed  for  studying  differences 
of  two  samples  over  the  past  50  years.  We  suggest  extensions  of 
Parzen's  model.  Finally,  we  give  a  few  simulated  examples  and  suggest 
what  type  of  simulation  study  is  needed  to  further  define  the  usefulness 
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1.  INTRODUCTION 


1.1  The  Problem 

A  fundamental  problem  of  statistical  theory  and  application  is  the 
two  sample  problem,  i.e.,  comparing  two  populations  given  random  sam¬ 
ples  from  each.  For  example,  researchers  are  often  interested  in  in¬ 
ferring  the  effect  of  a  treatment  on  a  response  variable  for  some 
general  population.  The  inference  is  based  on  the  observed  responses 
from  a  control  group  and  a  treatment  group  selected  from  the  popu¬ 
lation  being  studied.  The  various  methods  of  dealing  with  these  two 
data  sets  have  been  generalized  to  k  groups  and  have  also  been  used 
in  developing  general  statistical  theory.  The  two  sample  problem  has 
indeed  been  a  cornerstone  of  statistical  science.  We  begin  by  giving 
a  series  of  definitions  basic  to  the  approach  we  shall  use.  In  these 
definitions  we  follow  Parzen  (1961,  1967,  1979,. 1980). 

1.1.1  Definitions  and  Notations 

We  have  independent  realizations  {X,,  X0,  ...,  X  }  and 

14  m 

<Yr  Yj,  ....  Y^)  of  continuous  random  variables  X  and  Y  having  con¬ 
tinuous  increasing  distribution  functions  F(x)  and  G(x)  respectively. 
The  distribution  functions  F(x)  and  G(x)  often  represent  those  of  the 
control  and  treatment  groups  respectively.  A  popular  model  for  X^ 
and  Y j ,  which  is  assumed  in  this  work,  is  that  both  distributions  are 
a  location  and  scale  change  from  a  common  distribution  function, 

Fq(x),  i.e.,  for  -•  <  x  <  •  , 

This  dissertation  will  follow  the  style  of  the  Journal  of  the 
American  Statistical  Association. 
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and  the  population  comparison  distribution  function  corresponding  to 
D(u)  by 

D(u)  =  F[H_1(u) ]  ,  0<u<l  .  (1.6) 

Alternative  definitions  for  a  comparison  function  are  FG  \  GF  \  G  ^F 
and  F  ^G.  Doksum  and  Sievers  (1976)  and  others  have  studied  some  of 
these  alternative  comparison  functions. 

Switzer  (1976),  Doksum  and  Sievers  (1976),  Wilk  and  Gnanades- 
ikan  (1968),  Doksum  (1974)  and  Steele,  Zimmer,  and  Williams  (1974) 
have  also  studied  comparison  functions.  However,  here  D(u)“F[H  ^(u)] 
is  preferred  because  it  tends  to  have  more  jump  points  than  any  of 
the  other  forms.  It  is,  in  a  sense,  "smoother"  than  any  of  the 
others.  Furthermore,  {R^;  If.  i  £  N)  the  set  of  relative  ranks 
of  the  X  sample  are  given  by 

^  =  m  F(H~L  (|)] 

as  noted  in  Pyke  and  Shorack  (1968)  .  In  sections  4. 1-4. 3  we  will 
provide  some  data  analytic  comparisons  of  our  approach  with  those 
using  alternative  comparison  functions.  These  comparisons  will  also 
emphasize  the  theoretical  differences. 

In  modelling  D(u)  and  Q(u)  we  require  the  density-quantile 
function  defined  by 

fQ(u)  -  f tQ(u) ]  -  F'CQ(u)]  ,  (1.7) 

and  the  score  function  defined  by 

J(u)  =  -  (fQ)'(u)  .  (1.8) 


» 


to  exist  for  all  results  presented  in  this  work. 


We  further  define  some  models  for  comparing  the  two  samples. 
Under  Hq:  F=G  and  alternatives  close  to  the  null  hypothesis  in 
location  and  scale  we  use  Parzen's  (1980)  model  for  D(u)  defined 
by 

D(u)-u=(l-X){0f  Q  (u)+i|>Q  (u)f  Q  (u) }  ,  (1.9) 

o  o  o  o  o 

lJ2~Xii 

where  9  =  -  and  =  — 1 -  .  We  also  compare  the  two  samples  hy 

°1  °1 

a  model  of  A^(u)  =  Qy(u)  -  O^Cu)  ,  the  difference  of  the  quantile 
functions.  This  model  will  be  suggested  by  Theorem  1.1  as 

foQo(u)AQCu)  =  (U2_Ul)  foQo(u)+(Vai)Qo(u)foQo(u)  *  (1‘10) 

which  is  valid  under  all  location  and  scale  alternatives  to  fl  :F=G. 

o 

We  denote  estimators  of  D(u)and  A^(u) as  D(u)and  A^(u) ,  respectively. 

Estimators  of  D(u)  and  A^(u)  can  be  obtained  using  the  results 
of  continuous  parameter  time  series  regression  estimation  developed 
by  Parzen  (1961,  1967).  A  detailed  discussion  of  these  estimators 
is  given  in  sections  2  and  6  and  is  essentially  taken  from 
Parzen  (1979),  section  9  and  10,  and  Parzen  (1980). 

We  define  a  Brownian  bridge  or  a  tied  down  Weiner  process 
to  be  a  normal  process  denoted  by 

(B(u)  ,  0  _<  u  1}  ,  (1.11) 

which  has  zero  mean  and  covariance  kernel 

KB(u1,u2)  =  min  (u1,u2)  -  u^ 


(1.12) 


Finally,  we  define  the  reproducing  kernel  Hilbert  space  (RKHS)  of  B(u) 
to  be  the  space  of  L 2  differentiable  functions  with  inner  product 

<f,g>p  q~/pq  f'(u)g'(u)du  +  ^f(p)g(p)  +  ^^f(q)g(q).  (1*13) 

Throughout  this  work  we  denote  weak  convergence  of  a  process 
by  and  convergence  in  distribution  by  "S'*. 

1.1.2  Questions  to  be  Addressed 

One  desires  to  infer  from  the  samples  how  the  populations  for 
the  two  samples  differ.  The  distribution  functions  F(x)  and  G(x) 
each  have,  in  general,  an  infinite  number  of  parameters  and  it  is 
our  task  to  summarize  or  characterize  the  differences  in  these 
distribution  functions.  The  quantile  function  has  advantages  in 
this  regard  as  remarked  in  Parzen  (1979)  and  Wilk  and  Gnanadesikan 
(1968)  . 

One  explanation  of  its  statistical  virtues  is  the  fact  that 
Q(u)  -  X(j)  ,  , 

where  is  the  jth  order  statistic.  The  order  statistics  are  the 

most  universal  set  of  sufficient  statistics  since  all  sufficient 
statistics  are  a  function  of  the  order  statistics. 

The  problems  we  address  are  illustrated  by  the  t-test.  If 
one  assumes  the  data  are  normally  distributed  with  o^  =  >  one 

obtains  an  exact  solution  (t-test)  to  a  well-posed  problem.  Of 
course, these  assumptions  are  usually  only  approximately  true. 

Thus,  the  t  distribution,  which  gives  both  a  test  of  Hq:  ■  Uj 

and  a  confidence  interval  for  ~  ^2  ’  Prov^-^es  exact  solutions 


6‘ 

to  approximate  problems.  If  f  ,  we  have  the  Behrens-Fisher 
problem,  which  is  usually  more  realistic  and  currently  has  no  exact 
solution. 

The  nonparametric  problem  considered  in  this  dissertation 

assumes  that  the  data  are  not  known  to  be  normally  distributed. 

One  problem  is  then  to  choose  from  a  collection  of  Fq  functions 

those  which  best  fit  the  data.  Another  problem  is  to  develop 

techniques  to  estimate  and  test  hypotheses  concerning  the  parameters 
U2_wl  °2_°1 

9  ■  — -  ,  ip  -  — - -  ,  u2  ~  W1  ’  anc*  a2  ”  °i  •  These  techniques 

then  provide  tests  of  Hq:  F-G  and  an  estimator,  D(u)  ,  of  D(u)  . 

Through  D(u)  -  D(u)  we  provide  techniques  to  determine: 

(1)  whether  the  two  samples  differ  in  location  and  scale 
parameters  for  a  given  fQ,  and 

(2)  whether  the  assumed  density,  f  ,  can  model  the  data 
well. 

We  implement  and  expand  some  of  Parzen's  (1979,  1980)  results. 

In  cases  where  several  Fq  may  model  the  data,  one  can  compare  the 
various  estimates  of  9,  p,  ~  *  and  °2  ~  °1  ’  w^'en  suanti" 

tative  or  oualitative  differences  exist  among  the  various  F^  we 
will  suggest  larger  samples  for  more  power,  or  subject  matter  based 
selection  of  Fq  rather  than  statistically  based  selection. 

1.2  The  Solution 

We  introduce  the  solution  in  this  section  and  give  the  detailed 
implementation  of  the  solution  in  sections  2.1  through  2.5 and  6.2. 
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The  aim  of  the  approach  implemented  in  this  work  is  to  simul¬ 
taneously  estimate  location  and  scale  differences  between  two 
populations.  Our  approach  emphasizes  estimators  of  location  and 
scale  differences  that  are  asymptotically  optimal  for  the  same 

underlying  f  .  The  approach  may  also  provide  diagnostics  for 
o 

skewness,  long  tails,  bimodality,  and  estimates  for  nonconstant 
shifts  in  the  various  quantiles  of  a  population  using  techniques 
from  Parzen  (1979).  The  approach  begins  with  D(u)  *  F[H  ^(u)] 
and  its  raw  estimator  D(u)  »  F(H  ^(u)]. 

Since  F(x)  *  G(x)  iff  H-1(u)  -  F_1(u)  -  G-1(u)  iff  D(u)  - 
F[H  \u)]  *  u,  the  comparison  function  D(u)  can  be  used  to  test 
Hq:  F(x)=G(x)  by  testing  Hq:  D(u)*u.  The  asymptotic  distribution 
of  D(u)  under  H^:  F**G,  is  given  by 

T  1-A  ! 

"ft  [D(u)-u]  -*■  (  .  °)  i  B (u)  ■=  cB (u) 

A 

O 

where  X  X  ,0<X  <1.  A  proof  of  this  fact  is  outlined  in 

No  o  r 

Parzen  (1980),  and  essentially  given  in  Pyke  and  Shorack  (1968). 
Parzen's  (1980)  representation 

vft[D(u)-u]«(l-X)[ef  Q  (u)-HJ/Q  (u)f  Q  (u)  ]  +  cB(u) 

0  0  o  o  o 

is  adopted  here.  In  essence,  this  is  the  result  of  a  linear  Taylor 
series  expansion  for  D(u)  which  we  discuss  in  section  2  in 
greater  detail. 

As  a  result,  with  Parzen's  (1961,  1967)  results,  we  simultan¬ 
eously  estimate  6  and  < l>  from  a  commonly  assumed  fQ  as 


« 


**  \ 


\ 
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m  r<f  Q  ,f  Q  >  <f  Q  ,Q  (f  Q  )>  1 

. .  9  I  oxo’  o  o  cro'^o  oxo  j 

2  "  <f-.Qn»Q_^fr.Q_)><Q_(f  Q_) *Q_Cf_Q_)>  J 

LipJ  o  o  o  o  o  ooo  ooo  -* 

.  r  <fQv  D(u)  ■  u>  1 

L  <Q-(f  Qa) •  D<u>  -  u>  J 


-1 


where 


<fi'f2>p,<i  ■  /;  *;«•>  »;<»>  *> + 

and 

<fl,f2>  "  llm  <fl»f2> 

p-*0  p,q 

q-»l 


Estimators  obtained  when  one  uses  <f^,f2>  ,  0  <  p  <  q  <  1  , 

p.q 

are  briefly  mentioned  in  section  2.4  where  we  obtain  trimmed  or 

truncated  estimators  of  9  and  ip  for  the  exponential  density.  Also, 

note  that  the  inner  product  <f ^ , ^2>  cxist9  f°r  ®*ny  ®ore 

p.q 

density  functions  than  does  s*nce  fhe  latter  requires  that, 

for  j-1,2  .  f j  (p)  ■+  0  as  p  -*■  0,  1.  The  computational  formulas  for 
6  and  ip  are  surprisingly  simple  for  many  densities.  The 
similarities  of  tests  based  on  6  and  ^  to  other  tests  will  be 

A  A 

established  in  sections  5.1  through  5.4.  Table  1  elves  ft  and  p 
for  several  fQ  densities.  Before  proceeding  to  the  derivation  of 
these  estimators  we  will  consider  a  model  for  the  quantile  functions. 


V* 


11 


The  quantile  functions  and  Qy  can  also  be  plotted  to 
compare  the  two  samples.  These  plots  and  their  box  plots,  as  de¬ 
fined  in  Parzen  (1979),  give  initial  indications  of  skewness,  bi¬ 
modality,  and  differences  in  location  and  scale.  A  model  for  the 
differences  at  each  quantile  is  given  in  the  following  theorem. 


Theorem  1.1;  If  (X^  ;  1  ■  1,  _ _  m}  is  a  random  sample  from 

x-u 

F(x)  *  F  ( - )  and  {Y  ;  j  ■  1,  ....  n}  is  an  independent  random 

o  (7^  j 

x-u2 

sample  from  G(x)  »  F  ( - ),  where  f '  exists,  f  >0  is  continuous  and  tail 

o  o  2  °  0 

monotone  [see  Parzen  (1979),  p.  116],  then,  as  N  +  »,  such  that 

XN“^Xo  (°<>0<1). 

^  f0Q0(u)[QY(u)-Qx(u)-(p2-p1)-(o2-o1)Qo(u)]  i  c  B(u) , 


2  2  2 

where  c  ■  X  a,  +  (1-X  )  a,  and  B(u)  is  a  Brownian  bridge. 

O  1  O  L 


Proof:  From  Parzen  (1979),  since  f  is  tail  monotone,  we  have 
-  o 

^j(foQo(u))(QY(u)-u2-o2Qo(u>)  t(l-lo)1‘o2  B2(u)  , 

as  N  +  «,  and 

-^(f  0  (u))(QY(u)-u.-o.Q  (u))  i  -X  * V  B  (u)  , 

00  X  llo  Oil 

as  N  +  «  where  B^(u)  and  B2(u)  are  independent  Brownian  bridges. 

Thus,  the  independence  of  Q  and  Q  yields 

A  I 

^(foQo(u))[(QY(u)-Qx(u)-(u2-u1)-(o2-o1)Qo(u)]  *  Z(u)  , 


*r 


\ 
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where 

Z(u)  -  (l-X^*5  o2  B2(u)  -  X B^u)  . 

Now,  E[Z(u)]  =  0  ,  since  B^(u)  and  B2 (u)  are  zero  mean  normal 
processes.  For  0  _<  _<  1  and  0  <_  u2  <_  1,  we  have 

cov  [Z(Ul),  Z(u2)]=  cov  [  (l-Ao)!?o2B2(u1)-Xoilo1B1(u1)  , 

<1-)o)SB2<'j2)-,o\B1<u2>1 

■  EI(Ho)o^B2<u1)B2(a2)-l0'’o1(l-X0)>‘ 
•o2B2(u1)Bl(u2)-2o'5a1(l-Ao)lio2B1(u1) 
•  B2(u2)+X0ct^B1(u1)B1(u2)] 

-  E[(l-Xo)o2B2(u1)ff2(u2)]+ 

E[X(jo^B1(u1)B1(u2)] 

-  (1-Xq)o2  cov[B2(u1),B2(u2>]  + 

Xo°l  cov[B1(u1),B1(u2)] 

2  2 

*  ^oal+*1'*o*°2^mln*ul,u2*~ulu2^  • 

Since  linear  combinations  of  independent  Gaussian  process  are 
Gaussian,  we  have 


I 


I 


V 
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Z(u)  -  [X  o?  +  (1-X  )o^]!iB(u) 
o  I  o  i. 


We  thus  have  a  model  for  A^  *  .  We  also  emphasize  that  A^ 

seems  to  be  a  very  interesting  and  interpretable  function  since  It 
quantifies  the  differences  between  X  and  Y  at  every  quantile, 
ue(0,l).  We  will  then  be  able  to  obtain  diagnostics  for  how  the 
different  quantiles  of  the  populations  are  changed  by  the  treat¬ 
ment.  Further,  some  commonly  ised  diagnostics  which  are 

functionals  of  A  *  Q  -  c are  -  X,  ^  A.(u)du  and  the 
W  Y  i.  i.  1  O 

difference  in  medians  A^f-^).  Furthermore,  if  A^=  k,  a  constant, 
then  0^=0 2  and  (k  =*  0  implies 


This  approach  will  thus  provide: 

(1)  tests  of  u.  -  Uo  for  several  f  , 

i  L  O 

(2)  tests  of  a,  •  a_  for  several  f  , 

1  i  o 

(3)  simultaneous  tests  for  (1)  and  (2)  for  several  common  f 

o 

for  each  sample, 

(4)  estimators  for  the  parameters  of  (1),  (2),  and  (3), 

(5)  models  for  estimating  the  difference  at  all  quantiles, 
Aq(u)  , 

(6)  graphical  comparisons  of  the  two  samples  , 

(7)  a  basis  for  theory  on  similar  results  for  skewed  and 
bimodal  data  , 

(8)  a  basis  for  theory  on  similar  results  using  trimmed 
estimators,  i.e.,  inner  products  with  0<p<u<q<l. 


(u) 
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1.3  Contributions  of  this  Research 


We  implement  and  extend  the  techniques  of  Parzen  (1979,  1980) 


(1)  giving  calculation  formulas  for  0  and  1 p  for  seven 
underlying  densities; 

(2)  using  the  relation  of  9  and  i|>  to  other  linear  rank 
test  statistics  to  provide  finite  sample  size  tests 
and  parameter  estimates  based  on  0  and  ip  ; 

(3)  providing  calculation  formulas  for  the  two  parameter 
exponential  distribution  for  truncated  estimates  of 

0  and  ip  in  the  two  sample  problem; 

(4)  proving  the  asymptotic  normality  of  D(u)  *  [D(u^) , 

~  a  f 

D(u2),  D(iik)]  for  fixed  (u^  1=1,  ...,  k}; 

(5)  proving  the  asymptotic  normality  of  I)(u)  -  I)(u)  for 
fixed  { u^ ,  i=l,  ....  k}  ,  which  we  use  to  select  an 
underlying  fQ  ; 

(6)  deriving  a  model  for  Ag(u)  =  Q^Cu)  -  Qx(u)  ,  the 
differences  at  each  quantile,  u  ; 

(7)  giving  estimation  formulas  for  and  o2  -  cr^ 

simultaneously  based  on  Q  and  Q  ; 

I  A 

(8)  finding  the  asymptotic  distribution  for  Ag(u)  ■ 

Qy(u)  -  Q^(u)  at  fixed  u  ;  and 

(9)  providing  graphical  comparison  techniques  via  D(u)  -  D(u) 
and  Aq(u)  -  Ag(u)  • 


V  . 


We  emphasize  that  all  the  theoretical  contributions  in  this 
work  are  based  on  a  location  scale  difference  of  two  independent 
random  samples  with  an  assumed  underlying  f  family  common  to  both 
populations . 
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2.  STATISTICAL  INFERENCE  BASED  ON  D(u) ,  8,  and  <(/ 

In  Chis  section  we  present  the  basic  method  for  making  Inferences 
about  two  populations  given  random  samples  from  each  which  are  in¬ 
dependent.  We  assume  the  two  populations  and  samples  are  as  given 
by  definitions  in  section  1.1.1.  In  section  2.1  we  outline  the 

results  of  Parzen  (1961,  1967)  which  provide  the  theory  to  suggest 

'  -  '  y2~Ul  °2~°1 

the  estimators  8,  4i,  and  D(u)  for  9  =  - ,  <t>  =  - ,  and  D(u)  = 

°1  °1 

F[H  1(u)]  also  defined  in  section  1.1.1. 

In  section  2.2  we  use  these  results  to  obtain  the  computational 

A  A  A 

formulas  for  8,  ,  and  D(u)  which  lead  to  the  relationships  given 

earlier  (Table  la/b,  p.  9)  for  several  density  functions  fQ .  In  section 
2.3  some  large  sample  distribution  theory  for  the  estimators  ob¬ 
tained  in  section  2.2  is  discussed. 

Since  the  methods  of  section  2.1  through  2.3  do  not  apply  for 
all  choices  of  fQ,  in  section  2.4  we  show  how  we  may  use  truncated 
estimators  using  formula (1 .13)  for  the  particular  case  of  the  two 
parameter  exponential  fQ.  Finally,  in  section  2.5  we  describe 
some  finite  sample  size  distributional  results  for  the  estimators 
of  sections  2. 1-2. 4. 

2.1  Time  Series  Regression  and  Preliminaries 

Using  the  definitions  in  section  1.1.1,  Parzen  (1980)  has 
suggested  a  model  for  D(u)  which  we  use  to  obtain  estimators  of 


,  i>  = 


and  D(u).  The  model  is  particularly  sug- 


0 


V2~Vl 


a2~°l 


"1  "1 

gested  when  9  and  ij>  are  near  zero,  i.e.,  the  remainder  terms  of  a 
Taylor  series  expansion  are  small.  For  D(u)  =  FH  ^"(u)  when  0 
and  p  are  small,  Parzen  (1980)  suggests  that  we  may  use 


D(u)-u  =  ( 1-1)  [ 0  fQQo(u)+iM5o(u)  f^u)  ]  +  (£-)**  B(u)  . 


x-u, 


This  model  briefly  is  a  Taylor  series  expansion  of  F  about  (- 


when  0  and  • p  are  small.  A  sketch  of  Parzen's  justification  for 


D(u)-u  =  (1-1)  [0  fO(u)+*Q(u)  f0(u)] 
o  o  o  o  o 


is  given  below. 


Derivation  of  Parzen’s  Representation  for  D(u)-u 


Since  0o^  *  y2~yl  an<1  °i  ^  +  ^  =  °2*  we  have 


x-p_  x-u.-Oa. 

.  G(x)  =  F  (-~ )  -  F  (  t  ()  . 

o  o  2  o  0^(1-^) 

-1  .  2 

Since  (l+iji)  =  1-tMwhemJ;  is  small)  and  0i{>  =  0,  we  have 


x-u. 

G(x)  i  F  ( (— — -)  (1-ip)  -  9] 
o 


F  ((— -)  -  [9  +  *(-—■ i)]} 
0  °1  al 


A  linear  Taylor  series  expansion  of  this  representation  of  G(x) 
X"W1 

about  (— — )  gives 


x-y  x-u  x-u, 

G(x)  *  v-sp  +  fo(^7>  f- 6  - 


Substituting  this  in  H(x)  =  XF(x)  +  ( 1—  X )  G(x)  gives 


H(x)  =  XF(x)+(l-X)F(x)-(l-X)f  ( - -)[0+«K - -)] 

o  ox 

X-U .  x-u 

=  F(x)-(1-X)f  ( - -H0+*( - -)]  . 

0  °I  CT1 

Letting  x  =  H  *(u)  and  rearranging  terms,  yields 

H_1(u)-u  H_1(u)-u 

D(u)  =>  u  +  (1-X)  fQ( — - - -)[8+*( - ^ - -)]  . 

if-^-u.  f"1(u)-u1  ,  , 

Since  [f  ( - - - )-f  (— - - )]3  and  [F  (u)-H  (u)]  i|i  are  an  order 

O  0^  o  0^ 

smaller  than  0  and  ip,  as  6  and  ip  go  to  zero  we  have 

F_1(u) -U,  F_1(u)-U. 

D(u)  -u  i  (  1-X)  f  ( - - - -)[©+*( - ±)] 

o  o^ 

*  (1-X)  f  Q  (u)  [9+*  Q  (u)  ] 

0  0  o 

1-X  4 

The  error  term  of  (~y~)  B(u)  that  Parzen  suggests  is  adopted  from 
Theorem  4.1  of  Pyke  and  Shorack  (1968)  with  the  constants  of  their 
Lemma  3.1  and  equation  3.7.  We  give  the  result  we  need  in  Theorem 
2.1. 


Theorem  2.1:  If  the  conditions  of  Theorem  4.1  in  Pyke  and  Shorack 
(1968)  hold  and  F(x)  =■  G(x)  for  F  and  G  as  defined  in  section  1.1.1, 


then 
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T  1_X  U 

/n  [0(u)-u]  *  (-j~)*  B(u)  , 

0 

where  X^  =  ^  "*  ^Q(0  <  Aq  <  1)  as  N  +  •"  and  B(u)  is  a  Brownian  bridge. 

Proof :  Pyke  and  Shorack  (1968)  define 

Lj,(u)  =  /N  [D(u)-D(u)  ]  =  /5  {F[H~L(u))  -F[H‘1(u)]}  , 
and  show  for 


V  (u)  =  Ln(u)  -  I-^1  ’ 


=  0  ,  0  <  u 


r  a  .  s  . 

that  p(L^,  L^)  0,  where  p  is  the  uniform  metric,  and 


L  '  (u)  *  L  (u)  . 
N  0 


Under  H  :  F  ■  G,  we  have 
o  ’ 


Lq(u)  =  (1-Xq)  B1(u)  -  (1-^)”^  B2(u)}  , 

where  B^(u)  and  B^(u)  are  independent  Brownian  bridges.  As  in  our 
Theorem  1.1,  we  have 

L0(u)  *  (1  -  XQ)  (c  B (u) }  , 

where  c2  -  X_1  +  (1-X  )_1  -  [X  (1-X  )  ]_1 
o  o  o  o 

Therefore, 

(1-X  )  1-X  , 

Lo(u)  *  "V--2 - U  B<u)  “  »(u)  • 

°  X *(1-X  Xo 

o  o 
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and 

„  1-X  . 

[D(u)  -  u]  -*■  ( - B(u) 

X 

o 

We  note  that  although  this  error  term  is  onlv  shown  to  h e  co*~rect  when 
9  *  ip  ■  0,  we  shall  assume  that  it  is  aoDroximatelv  correct  for  fl  and 
ip  close  to  zero. 

That  is,  we  use  the  Parzen  (1961,  1967)  results  to  obtain 

estimates  of  9  and  \p  under  H  :  9  -  =  0  which  we  assume  will  be 

o 

useful  for  9  and  4i  near  0  also.  Exactly  under  what  conditions  this 

is  justified  is  an  open  research  problem.  One  can  calculate  estimators 

for  all  continuous  f  Q  and  Q  (f  Q  )  in  the  RKHS  of  B(u)  (see 

o  o  o  o  o 

section  1)  with 


<f  ,f»>  -  lim  <f.  ,f,> 

P-0  f,q 

q-1 


where 


<fL,f2>  -  J  q  f{(n)f  ’(u)du  +  if1(p)f2(p)  +  -ji-f  (q)f2(q) 

p.q  p  v  M 

The  conditions  of  Lemma  2.1  are  sufficient  for  the  estimation  of 

9  and  <p  using  Parzen  (1961,  1967).  This  gives 


(1  -  X)  (®)  *  E_1  £ 

* 


where 


l  - 


<f  Q  ,f  Q  >  <f  Q  ,0  (f  Q  )> 
O  O  0  0  o  o  o  o  o 


.  <Qn(foQr.)*fnQn>  1 M  M  >  J 

L  O  o  o  oo  ooo  000  J 


\ 


V 
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and 


1 


<f  Q  ,D(u)-u> 
o  o 


<VW-D(U)-U> 


as  in  section  1.2, for  a  solution  to  the  normal  equations 


(1  -  *)  E(!)  =  £ 

'll 

The  estimators  0  and  then  give 

D(u)  =  u  +  (1-A)  [9  f  Q  (u)  +  (/  Q  (u)f  Q  (u)  ]  , 

oo  o  o  o 

using  Parzen's  model  for  D(u) . 


2.2  Calculation  of  0,  ifr,  and  D(u) 

In  this  section  we  give  some  lemmas  useful  in  calculating  E 

and  jj  for  various  fQ.  We  then  calculate  9  and  ip  for  seven  different 

f  densities, 
o 

We  consider  here  the  following  fQ(x);  (  -  ®  <  x  <  <*>  ,  unless 
otherwise  specified) 

1  -x^/2 

Normal  f  (x)  =  -  e  *  »  F  (x)  =  $(x) 

0  sTir  ° 

• 

Logistic  f  (x)  »  ex(l  +  ex)“2  ,  F  (x)  =  (1  +  eV1  . 

o  o 
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Cauchy 


-1  2  -1 
f  (x)  -  ff  A(l+x  ) 
o 


F  (x)  =  ^  +  —  tan  1(x) 
o  l  w 


f  (x) 
o 

Double 

Exponential 


lx  _ 
j  e  ,  x< 0 

_1  e  Xx>0 
2  ’ 


F  (x) 
o 


lx  A 

2  ®  »  x<_0  i 

1  -  j  e-X,  x>0  . 


"Ansar i- 
Bradley" 


t  F  (x) 

fQ(x)  =  |(l+|x| ,  ° 


1  ,-l 

=  -(l-x)  , 


x<0 


\  +  \  (l-(l+x)~1],x>0. 


"Quartile" 


f  (x) 
o 


16x 


ixli  i  * 

mHj  * 


F0(X)  "*  Ita*  X-'i  ’ 

=>  -|  +  x,  x  E  , 


=  1- 


1  ^  1 
16x  ’  X  4 


Exponential  fQ(x)  =  e  *  x  >  0  ,  FQ(x)  =  1  -  e  ,  x  _>  0, 
=  0,  x  <■  0,  =  0,  x  <  0  . 


-x 


The  formulas  for  I  and  g  require  several  inner  products.  The 
calculation  of  these  inner  products  can  be  simplified  for  many  f 
by  using  the  following  lemmas. 

Lemma  2.1:  If  f  Q  and  Q  (f  0  )  are  L»  differentiable  functions, 
-  O  O  0  0  0  l 

then 


when 


(1)  <f  Q  ,f  Q  >  -  /  1  J  2(u)du 

o  o  o  o  '0  o 


[f  Q  (P)]2  [f  Q  (1-P)12 

(i)  0  -  lim  —2~2 -  -  lim  — — - 


p-*0 


p  *0 
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(2)  ‘VW1  V  W>  ■  lo  11  -  'Jo(',)Jo(u)l2  du 


when 


(ii)  0 


tQo(p)Wp)1  [Qo(1-p)f  Q  (l-p)]‘ 

lim  -  _  lim  — — — - 


p-*-0 


p-*-0 


<3)  <foV  VW>  '  /0lQo(u)Jo2(u)du  -  /Oljo(',)du 


when  (i)  and  (ii), 


m  R. 


(4)  <fQQ0»  D(u)-u> 


■  /o1  Vu)du  -  s  l  ViST> 


i-1 


when  (i),  and 


(5)  <Qo(foQo),D(u)-u>  -  I,,1  Qo<u)Jo(u)du-i  "  Q04r>VSI) 

i“l 


when  (ii) . 


Proof:  [Adapted  from  Parzen  (1979,  1980)] 


(1)  By  definition,  J  (u)  ■  -  (f  Q  )  (u)  which  gives 

o  o  o 


<f  Q  ,f  Q  >  =  lim  <f  Q  ,f  Q  > 
o  o  o  o  ^  o  o  ovo 

q-*-l 


p.q 


o  (f0Q0(p)l2  UQ(q)]2 

nmlPl-J  (u)]‘du+-2-4 - +  ——■ - ] 


p-*0 

q-1 


'p  o 


1-q 


/q1  Jo2  (u)du 


(2)  Since  qQ(u)  ■  l/fQQo(u),  and 

[Qo<u)fo,.<u)1'  '  + 

m  -Q  (u)J  (u)  +  f  Q  (u)q  (u) 
0  0  o  o  o 

-  1  -  Q  (u)J  (u)  , 
o  o 

<<Uf«(U»<Ufn(U>  ”  In  [l-Qn(u)J  (u)]2  du 

OOO  OOO  0  O  o 

when  (ii). 


(3)  Similarly, 


<f  Q  ,Q  (f  Q  )> 
OOO  o  o 


!q  [-Jq(u}] [1-Qo(u)Jq(u) ]du 

Iq  Vu)J02(u)du'_  Iq  Jo(u)du 


when  (i)  and  (ii) . 


(4)  Similarly,  since  D(l)  *  l  and  D(0)  *  0, 

<f  Q  ,D(u)-u>  «  /  1  [-J  (u)]d[D(u)-u] 
o  o  '  0  o 

-  L1  J  (u)du  -  f1  J  (u)d  D(u) 
'0  0  0  o 

m  R. 

I  J 

i-1 


m  K 

/nl  J  (u)du  -  -  l  J  (— )  , 
'0  o  m  o  N+l 


1  i 

recalling  that  D(u)  has  jumps  —  at  u  ■  tttt 

m  Prrl 
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(5)  Also, 

<Q  (f  Q  ),D(u)-u>  =  /  1  {1-Q  (u)J  (u)  ]d[D(u)-u] 
o  o  o  u  o  o 


/  1  dD(u)-fo  Qo(u)Jo(u)dD(u)-l +/q  Qo(u)Jo(u)du 


i  .  ui  w  , 

/„  <(>»„<«)*.  -  -  [ 


1=1 


ovN+r  ovn+r 


Remark:  The  tail  conditions  of  Lemma  2.1  essentially  are  the  conditions 

needed  besides  L„  differentiability  for  f  0  and  Q  (f  0  )  to  be  in  the 
2  o  o  o  o  o 

RKHS  of  B(u)  for  p  =  l-q  =  0.  To  make  this  clear  is  why  we  include 
i omma  2.1.  This  lemma  is  used  to  show  9  and  are  linear  rank  statistics . 


Lemma  2.2:  If  in  addition  to  fQQo  and  Qq ( fQQo)  being  L2  differentiable, 
we  have  that  fQ  is  symmetric,  then 


(1)  <fQQ(fQ)>  =  0 

o  o  o  o  o 

when  (i)  and  (ii)  of  Lemma  2.1  and 


1  m  Ri 

(2)  <fnQn,D(u)-u>  -  -  - 


when  (i)  of  Lemma  2.1 


Proof :  Since  fQ  symmetric  is  equivalent  to  fQQo(l-u)  =  fQQ0(u)  or 

J  (1-u)  =  -J  (u)  or  Q  (1-u)  =  Q  (u)  and  Lemma  2.1  holds,  we  have 
o  o  o  o 


«>  <foVVW’  ■  /0lQo(u)Jo2(")du-/oljo(u>d" 

‘  ^0!|,o<u)Jo2<u)d',+^0i  Vu,du 

-/o!«o<u)Jo2<u)du-JoJo(,')du 

=  0  , 
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and 

(2)  <f0Q0,D(u)-u>  -  -  i  ha<&>.  »i»«  /0lJo(«)d«  ’  0. 

Theorem  2.2:  If  f  0  and  0  (f  0  )  are  in  the  RKHS  of  B(u)  with 
'  o  o  o  o  o 

p  «  1  •  q  •  0,  F  «  G,  and  f  is  symmetric  with  the  tail  conditions 
of  Lemma  2.1,  then 

m  g 

a-»)9-  i/oVo*!'1!-  ; 

i=l 

and 


<1-*)*  -  {/01[l-Qo(u)Jo(u)]2du}‘1[/01Qo(u)Jo(u)du 


m 


~  ^  Qo*N+l^  Jo(N+1^  1 


i-1 


Proof :  Since  Lemma  2.1  and  2.2  give  the  terms  of  Z  and  £,  we  have 
(1-X)(?)  =  Z  *  £  as  given  in  Theorem  2.2. 

Note  that  our  Z  for  the  two  sample  case  is' the  same  as  the  one 
sample  Z  in  Parzen  (1979)  and  Eubank  (1979).  In  order  to  carry  out 
the  estimation  of  9  and  tji  as  given  in  the  above  theorem,  we  need 
foQQ(u),  Qq(u),  and  JQ(u)  for  each  density.  They  are  given  in  Table  2 
We  obtain  the  results  in  Table  2  for  the  normal,  logistic, 
and  Cauchy  densities  as  in  Parzen  (1979)  and  Eubank  (1979).  The 
others  are  also  obtained  by  using 

Q_(u)  -  F  _1(u) ,(f  Q  ) (u)*f  (F  _1(u) ]  or  f  Q  (u)-l/Q  * (u) 

O  O  OO  00  oo  o 


or  J  (u)  -  -(f  Q  ) ' (u) 
o  o  o 


t 
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2.  Densltv-Ouant ile.  Quantile  and  Score  Functions 


f 

f  Q 

Q 

J 

0 

0^0 

0 

o 

Normal 

1  -lil*'1(u)|2 

-  e 

*_1(u) 

*  1(u) 

/Jtt 

Logistic 

U  (1-u) 

.  U 

108 

2u-l 

Cauchy 

—  sin2(iru) 

7T 

tan  [  ir(u— j)) 

-sin(2wu) 

Double 

1 

u  ,  u  _<  - 

log  2u  ,  u  ^  y, 

1 

,  1 
-1»  u  <  J  , 

Exponential 

1-u,  u  ^  - 

-log2(l-u) ,  u  _>  J 

1.  u  >  - 

Ansari- 

,  2  <  1 

2u  ’  u  -  2 

.1-1  <  1 

1’2U  *  U-  2 

-4u,  u  <±, 

Bradley 

density 

2  (1-u)  2 ,  u  >  y 

-1+yd-u)"1,  u>y 

4  ( 1-u)  ,  u  >y 

,,  2  1 

16u  ,  u  <  -  , 

1  -1  <  1 
"ii”  *  u  "  4 

-32u,  u  <  -  , 

Quartile 

0  •  «  «  (J.f)  • 

i  t1  3^ 

u_  2  ’  ue(4*4)8 

o.  «<H>’ 

density 

16U-u)2,  u 

ied-u)"1*  u  >| 

32  (1-u) ,  u  >  ^ 

Exponential 

A 

-i 

1  -  u 

log  (1-u) 

1 

*Not  in  RKHS  of  Brownian  Bridge  process  for  p  ■  1  -  q  «  0 


3.  Computational  Formulas  for  0  and  </> 


Density 

(!-')  9 

(1-X)  1 1) 

Normal 


m 


— E  [ ^ — )  I  2 
2  2m  1  lN+r  1 


Logist ic 


3  -  -  (~i-) 

m  N+l; 


-(fr^iog^)  i2(^)-i] 

3+ir  i 

i_N+l 


,N+1 


Cauchy 


m  z  sin(2ir 


hh  EsinI2*(Nir)]tan[*(¥h:- 


Double 
Exponent ial 


i  E  sl»nf?  -  sii1 


R  R 


"AnsarL-  3  _  ,Ri  .  Ri  1  Ri  -  12  Ri  1  1  Fi 

Bradley"  m  E  min(STT>1  “  STT}  sl«n(T  *  34  ~  r  <STT  ~  T)si8n(T  “  V+T 


N+l  N+l 


'2  N+l  m  N+l  2 


l2  N+l 


"Quartile' 


r R.  «  ■»■  R.  . 

L  (^}  +  H  ^  j 

r»  ^  ~  /%t  I  1  \  n  .  J  /»f  .  i  \  ®  n  L  r  ^ 


Rj  <  ^(N+i)  Rt  >  |(N+1) 


R^f-^N+l)  ,^(N+1)] 


1  +  i  -  5n> 


Exponential 


(Not  covered  by  RKHS  Theory) 
(Assumes  9  knovm) 


30 


which  yields 


t  m  ,  R. 

0  =  -  i  7  <^—  ^  ( — i — ) 

m  N+l; 

i=l 


1  1  “  ,  R.  a 

♦  -  i  -  =  j^"1^”2 


(Logistic)  Again,  since  the  tail  conditions  hold,  we  have 


:f  Q/„Q>  «  L1  (2u-l)  2du  =  ~  . 

o  o  o  o  0  3 


By  symmetry  of  fQ,  <^0Q0 .Q  (fQ0  )>  =  0. 


By  Eubank  (1979), 


‘VW’VW”  !o 


Then, 


3  0 


For  £,  we  have 


~  1  -  ?  -i 

<W(u)-u>  -  -  f0L  (2u-l)d[D(u)-u]  -1-J  [  (^)  , 

i»l 


<Qo(foQo),°(u)"U>  -  /01(l-(logi^)(2u-l)]d[D(u)-u] 

m  R  R 


\ 


which  yields 


and 


*  ,  m  R . 

e  ‘  3-i,V>si> 

i=l 


„  q  1  m  R-  R-  R- 

♦  =  ~  l  2^~  -1] 


3-Hr 2  m  i=l 


N+l 


"N+l 


(Cauchy)  Since  the  tail  conditions  hold,  we  have 

<f  Q  ,f  Q  >  *  fj"  sin^(2iTu)du  =  . 

o  o  oo  1 0  2 

By  symmetry  of  f  ,  *f  Q  ,Q  (f  Q  )>  =  0 
J  J  O  o  O  O  0  0 

and 

<Qo^foQo)  ,Qo(foQo):>  =/01[1  +  sin(2lTU>tan1T(u  -  f) 

Then, 


For  j>,  we  have 


<foQ0,D(u)-u>  =  i  l  sin[2w(^-)] 


i=l 


N+l' 


and 


<Q0(f0Q0),D(u)'U>  =  1  +m  l  Sin^27w>tan&r(i5ir 


m 


i=l 


N+l  N+l 


which  yields 
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and 


2 

5m 


l  si«[2*<^)Han[n(^  -  \) ) 
i=*l 


(Double  Exponential) 


<f  Q  ,f  Q  > 

0  0  0  0 


*  1  -*■  lim  p  +  11m  (1  -  q) 
p-K)  q-*l 


Again,  by  symmetry 


1  . 


T  2  2  ~  ^ 

<f  Q  ,Q  (f  Q  )>  -  O+limip  iog  p+  lim— (1-q) 

°  °  ooo  fH-0  ?  p+0  q 


‘VW-VW’ '  +  !h  u"c 

+  lim  -  [ploR2p] 2+lim  y 

p-*-o p  q-"1 


=  2  /^(l+log2u)2du-21im  p 
0  p+0 


Then, 


Lo  l  J 


For  £,  we  have 


<foQo,D(u)-u>  -  !q  d[D(u)-u]  +  (-l)d[D(u)- 


-  lim  (1-q) [D(l-q)-(l-q)l 
<ri 


m  . 

I  3i8n  [j  ~ 

i-1 


i_ 

N+l 


!+11"l^'£o5oU)!2 

q-’-i 

2log2 (1-q)  =  0. 

I  (u)]2du 
o 

— [(l-q)log2(l-q)] 

-q 

-2  lim  p  *  1  • 

p-*-0 

u]  +  lim  p[D(p)-p] 
p-K) 
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Next , 


< Q  (f  Q  )  ,D(u)-u>  =  /.i(l+log2u)d[D(u)-u]+  jf1  [l+log2(l-u)  )d[D(u)-u] 
ooo  ‘  u  *5 


+  lim  p  log  (2p)  [D(p)-p] 

p-0 

4-  lim  {  (l-q)log[2(l-p)]  (D(q)-q) } 

<r-i 


.  m  R.  R 

=  -  [log  2[min(^  ,  1  -  m)] 
i=l 

which  yields 

.  ,  m  R. 

1  r  r  1  i  i 

9  =  m  ^SXSn  l2  -  Wn] 

and 

*  =  m  X108  2[rain(Nil  *  1  ~  N^I)1  * 


("Ansari-Bradley") 


<f  Q  ,f  Q  > 
oo  o  o 


j'j  (-4u)2du  +  /,L  [ 4 ( 1-u)  ] 2du  +  lim  ±  [2p2]2 

0  ^  p-*-0  P 


+  lim  t~“  [ 2 ( 1-q) 2 ] 2 


q->l 


1-q 


2  2  4 

3  3  3 


Again,  by  symmetry  of  fQ, 


<Qo(foQo)>foV  “  0  +  “o  P  (2p2)2(1  -  ip) 

+  lim  2 ( 1-q) 2 } 2 -11-0 


V 


V 
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Next, 


:Qo(foQo),D(u)"u>  =  A)lQo(u)Jo(u)du  '  /olQo(u)Jo(u)d  °(u) 


+  lim  ^(1  -  ~)2p2[D(p)-p] 
p-KD  P 

+  Urn  -D2(l-q)2[D(q)-q] 


1  +  m  J  sign(2  '  N+1)(N+1  ~  2 ^  ’ 

i=l 


which  yields 


,  m  R.  R  R. 

-  J^min  *  1  "  sl®n  ~  jj+i* 


and 


»  1?  m  R.  R 

*  =  3  +  l  ^  ~  7>  sl8"  <7  -  • 


m  1^1 'N+l  2' 


("Quartile") 

<foVfoV  =  / o1  J02(u)du  +  lim  ^(16)2  p4  +  llm  it(16)2d-q) 

p-*-0  F  q-*-l  • 

By  symmetry. 


<fQ,Q(fQ)> 
o  o  o  o  o 


1  2  2  11 
0  +  lim  ~( 16p  )  1- ~  )— 

p+o  P  16  P 

+  lim  -p-  [  16  ( 1— q) 2  ]  ^-0. 

q-*!  1_<»  16  1“<? 


4  32 

“  3  • 
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Next, 


<VW-VW: 


/o‘[l-(-r5u'l>,‘32ul|2du 


+  /im4 


+  /3/ 4 


16 


+  lim  ~[(~7T^P  ^16p^] 
p-K)  P  16 

q->-l 


1  +  0  +  0  =  1 


Then, 


-1 


32 

3 


0 

1 


-1 


For  £,  we  have 


<f  Q  ,D(u)-u> 
o  o 


_  1  2  "* 

[  1  -J  (u)d(D(u)-u]  +  lim  -r  16P  [D(p)-pl 

0  0  p+0  P 


32 


+  lira  -ri-  16U-q)2  [D(q>-ql 

q*l  l"1 


T&-» 

R^CN+l)  R  >7(N+1) 

1  4 
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Next, 

<Qo(foQo),D(u)-u>  =  ~! q  dD(u)  -  /3/4dD(u) 

+  lim  ;[D(p)-p](-^)  ^  16p2 

r+o  P  16  P 

+  “j  _i_[D(q)_q]  16(1_q)2 

*  -  -  y  i 

m  Rii[k(N+l),3/4(N+l)]  , 

which  yields 


Ri 

0  =  -  ^(N+1) 

”  Rt<!s(N+l) 


and 


ip  =  -  -  y  i 

ra  R1|[V(N+l),3/4(N+l)] 


(Exponential)  Here  we  use  fQ(x)  *  e  X,  x  >  0,  and  assume 
0*0  since 


Here,  *  E(X)  and  a ^  =  E(Y). 

°2~°i  a2  °2 

Remark:  Since  it  m  -  ,  we  have  it  +  1  =  —  .  We  note  —  is 

-  0j_  01  Oj 

the  ratio  of  scale  parameters  which  is  often  studied  by  researchers 
(for  example,  the  F-test,  Siegel-Tukey ,  Ansari-Bradley ,  etc.  and 
more  recently  by  Bhattacharyya  (1977)).  Thus,  the  estimators  and 
tests  given  here  for  it  also  provide  results  which  may  be  used  for 
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the  ratio  of  scale  parameters,  — 

al 


2.3  Test  Statistics  and  Confidence  Intervals 

In  this  section  we  provide  the  asymptotic  distribution  of  9, 
!</,  and  D(u)  at  fixed  u  given  in  Theorem  2.2  and  2.3. 

2.3.1  Inferences  about  0  and 

Parzen  (1980)  provides  the  joint  asymptotic  distribution  of  0 
and  i|>  in  the  following  result.  The  proof  of  this  fact  is  essen¬ 
tially  also  given  in  our  method  of  oroof  in  Theorem  2.5. 


Theorem  2.4:  If  f  Q  and  Q  (f  0  )  are  in  the  RKHS  of  B(u)  with 
-  o  o  o  o  o 

A,  A 

p  =  i  _  q  =  o,F=G  and  0  and  <(/  are  as  given  in  section  2.1,  then  as 

=  7}  -*•  X  (0  <  X  <  1)  and  N  -*■  ®  ,  we  have 
N  N  o  o 


&  (.  )  °  N,[(°),  y  1  I"1]  , 


'I'-'l' 


21'0' 


where  y  =  A  (1-A  )  and  Z  is  as  in  section  2.1. 
o  o 


Remark:  Statisticians  often  define  quantiles  with  other  notations. 

For  example,  define  z  by  Z  %  N  (0,1)  and  P(Z  <  z  )  m  1  -  ^  . 

a  —  £  2 

2  2  a 

In  terms  of  the  quantile  function  we  define  z  ■  ♦  (1  -  j)  . 

-1  ^ 

We  also  denote  Z  by  C  =  (c^).  Theorem  2.4  gives  us  the  following 
confidence  intervals,  regions, and  tests  of  hypotheses. 

Note  that  the  confidence  regions  are  proved  correct  for  0  and  zero 
although  we  may  still  wish  to  use  them  when  0  and  1 1>  are  moderate. 


V 


m 


Corollary  2.1:  If  f  is  the  correct  density  and  D(u)  is  Riven  by  (1.9) 

then  for  y  *=  \  ( 1- X  ),  we  have 
o  o 


(1)  A  (1  -  a)  100%  asymptotic  confidence  interval  for  9 


9  *  *  . 

a  Ny 


(2)  A  (1  -  a)  100%  asymptotic  confidence  interval  for  ip 


c  9  2  h 
\ l)  +  z  (———)  ^ 
VNy  1 


(3)  A  (1  -  a)  100%  joint  confidence  region  for  9  and 
is  given  for  fQ  symmetric  by 

'  C11  4 

9  ±  z 

a  Ny 


;  .  ,  22, ld 

<l»  ±  Z  (- — ) 

a  Ny 


Corollary  2.2:  For  D(u)  given  by  (1.9)  a  test  statistic  for 

H  :  9  =  i|>  =  0  is  the  quadratic  form 
o 


L  -  Ny(9) •  I(?)  2-  x2(2) 


Remark:  C  «  Z  1  »  (c..)  was  calculated  for  several  f  in  the 
-  ij  o 

preceding  section  and  c^  and  are  given  In  the  following  table 
for  convenience. 


4 .  diagonal  Eluents  of  Liaitlne  Covariance  Metrlx 


f 

0 

cu 

C22 

Normal 

1 

1/2 

Logistic 

3 

9/  (3+ir2) 

Cauchy 

2 

2/5 

Double  Exponential 

1 

1 

"Ansari-Bradley” 

3/4 

3 

"Quartile" 

3/32 

1 

Exponential 

(assume  known 
location 
parameter 

9  -  0) 

1 

If  one  did  not  trust  the  D(u)  model,  then  a  nonpar ame eric  test 

of  H  :  F  ■  G  or  H  :  D(u)  *  u  may  be  constructed  from  the  distribution 
o  o 

sup 'N  |D(u)-u|~sup  c|B(u) | .  Durbin  (1973)  gives  expressions  for  the 
distribution  of  suplB(u)!  which  suggest  teat  statistics  which  do  not 
depend  on  a  parametric  model  for  D(u) .  One  may  thus  use  a  test  based 
suprtf! D(u) -u I  as  a  diagnostic  when  comparing  two  samples. 

Using  D(u)  and  the  asymptotic  distribution  of  3  and  we  may 
find  approximate  confidence  intervals  for  D(u)  when  u  is  fixed  and 


42 


our  model  for  D(u)  is  correct. 

2.3.2  Confidence  Intervals  for  D(u) 

From  the  asymptotic  distribution  of  D(u)  given  in  Theorem  2.5 
we  may  obtain  confidence  intervals  for  D(u)  at  specified  u.  First, 
we  give  two  useful  results. 

Lemma  2.3:  (Brown  (1970)  ,  Corollary  3.1)  For  two  square  integrable  functions 
f ^(u)  and  f^(u)  on  Oj^u^l  and  in  the  RKHS  of  B(u)  for  p  ■  1  -  q  *  0, 

E [/q1  f1(y)dB(y)/01  f2 (y)dB(y) ]-/Q1[ f L (y) -/0Lf : (u)du] 

•  [f,(y)-/0lf2(u)du^dy  • 

Lemma  2.4:  For  f  Q  and  Q  (f  Q  )  in  the  RKHS  of  B(u)  with 
-  o  o  o  o  o 

p  -  1  -  q  -  0  , 


and 


wx(y) 


/0ljo2 3(u)du 


w2(y) 


1-Q0(y)J0(y) 

/01[l-Qo(u)Jo(u)]2du 


(1)  /  1  W  (y)dy  *  0  » 

0  1 

(2)  /  ^  W_(y)dy  ■  0  ,  and  for  f  symmetric  also  > 

0  0 

(3)  /  1  W  (y)W  (y)dy  -  0 

0  1 


i 


we  have 
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Proof : 


(1)  /rtlW, (v)dv  =  [  ■  *J  "  ( u  3  d  u  1  1  i  (v3dv  =  0  , 

()  1  ‘  '  0  o  '  u  o  ’ 

since  J  (u)  =  -J  (1  -  u)  . 
o  o 

(2)  /  Sv  (v3dv  =  [[  l[l-Q  (u3  J  (u)  ]  “du  )  1 /  1 [ 1 — Q  (y)J  (v)]dy 

gd  (I  O  O  U  o  o 

=  0  , 

s  ince 


1  [  1  -Q  (v)J  (v) ]dv  =  J.^tQ  (u)f  Q  (u) )  du 
)  O  O  '  0  o  o  o 


=  { 0  (u) f  [ 0  (u)]} 
0  0  0 


1 

0 


0  , 


since  0  (f  Q  3  is  in  the  RKHS  of  B(u)  with  p  =  1  -  q  »  0  . 
o  o  o 


(3)  /01W1(y)W2(y)dy  = 


/./[-•J  (u)  ]  [  1-Q  (u)J  (u )  ]du 
'  0 _ o  o _ o _ 

(/-/j  ‘‘(u)du)  (/  *  [1-0  (u)J  (u)]2du) 
;  0  o  0  o  o 

0 


since  J  (u)  «  -J  (1-u)  and  Q  (u)  =  -Q  (1-u)  . 
o  o  oo 

In  the  following  theorem  we  give  the  asymptotic  distribution 
of  D(u)  under  the  null  hypothesis,  F  =  G. 


Theorem  2.5:  If  f  is  symmetric,  f  0  and  0  (f  0  )  are  In  the  RKHS 

-  o  '  o  o  o  o  o 

of  B(u)  with  p  =  1  -  q  =  0,  the  conditions  of  Tieorem  2.1  hold,  and 

F  »  G,  then  as  N  *  ■*>  such  that  ( 0  <  X  •  13 ,  we  have 

N  N  o  o 
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L  o’  1 

vN  [  D(  u)  -  ul  *4’(— - )^t'oQo(u)/o  W^(y)dB(y) 

o 

+  n  (u) f  0  (u)  /  V  (y)dB(y)  } 

o  o  o  '  0  2 

=  (u) 

which  we  call  "the  Brownian  bridge  representation  of  D(u)",  where 
W^(y)  and  W^lv)  are  given  in  Lemma  2.4. 

Proof :  By  definition  of  D(u>  we  have 

>N  i  D(u)-u )  =  >N(1-'H“  f  t(u>  +  -j  f  (U)l  , 

where  fOu'i  =  f  ')  (u)  and  fj'i)  =  Q  (u)f  0  (u).  Also  (by  definition 
]  C  O  -  o  o  o 

of  ind  .1 

(!-■'  ■  ^  n'*  (vVd(O(v)-y  ] 

and 

(1-0  .  -  '^W,  f  y  )d[  D  ( yd  -  v  ) , 

where  W,  and  W,  are  defined  in  Lemma  2.4,  a  and  are  given  in 

i. 

Theorem  2._.  ’''his  elves 

.  N  [  TW  u  i  -  u  1  =  1  ,  1  o  1  ;  (  v  id  f  .  N  (  D(  v)  -v  )  ]  +  f  (  u)  jq'-^  (y)d[*^HD(v)-y)l 

=  '  [  !  '  •  i ' u ,  !  v  )  f  ,  (  u  H*\  ( v  1  ]d[*N(D(v)~y)  )  . 


v 
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Since  f..(u)W  (y)+f  (u)W-'(y)  is  L„  and  »^(D(y)-y)+  c  B(y)  , 

1  1  L  L  i. 

1-X  , 

O  ^ 

where  c  =  (— — )  ,  we  have 
o 

t/N[D(u)-u]  ^/01[f1(u)W1(y)+f2(u)W2(y)]  dcB(y) 

=  c{  f^Cu) /Q1W1(y)dB(y)+f  2(u)  J0'LW2(y)dB(y)}  . 


This  gives, 

•fi[D(u)-  u]  h.  (^r2)ii{foQo(u)/01W1(y)dB(y)-H)o(u)foQo(u)/01W2(y)dB(y). 

o 

From  this  representation  we  are  given  the  asymptotic  distribution  of 
D(u)  . 


Remark:  Although  we  state  the  theorem  under  H^:  F  *  G,  we  hope  that 
for  9^  *  9//n  and  ip^  =  tp/yfi  ,  i.e.,  local  alternatives,  we 

may  expec t a  similar  result.  The  argument  needed  is  exemplified  in 
Lepage  (1975),  Hajek  and  Sidak  (1967),  Chapter  VI,  and  in  many  of 
their  references  but  complicated  here  by  the  error  term. 


Corollary  2,3:  For  the  assumptions  of  Theorem  2.5,  a  (1  -  a)  100% 
asymptotic  confidence  interval  for  D(u)  is 


D(u)  ±  z 

a 

2 


r—  f 
"N 


0 


V 


46 


where  -  /Q1W12(y)dy  and  c^2  *  j * 

Proof :  We  assume  the  result  of  Theorem  2.5  to  obtain, 

1  . 

V[Z1(u)]  -cV{f1(u)/0  W1(y)dB(y)+f2(u)/0  W2(y)dB(y)} 

=c2f  2[Q  (u)HV[J-LW.(y)dB(y)]+Qn2(u) 

0  0  U  i.  o 

•V[/01w2(y)dB(y)]} 

since  W1(y)W?(y)dy  ■  0.  Further, 

nl0\(ymiy))  -  £U/0\(y>dB(y)]2} 

-  /01[W1(y)  -  /QHj1(u)du]2dy 

-  /0V2(y)dy  , 

since  /gHf^(u)du  -  0  and 

v[/olw2(y)dB(y)]  *  EtCVy)dB(y)]2 

-  /01[w2(y)  '  /0S(u)du]2dy 

"  /olw22(y)dy 

Note:  The  values  of  c^  and  for  seven  densities  are  given 

in  Table  4  (p  .  41) .  The  same  densities  have  f  Q  and  Q  given  in  Table  3  (p.  29) . 
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The  distribution  for  D(u)  will  be  used  for  evaluating  the  model  of 

D(u)  and  for  selecting  f  in  section  3. 

o 

However,  we  first  illustrate  estimating  9  and  ^  with 
0  c  p_<u_<q  <  1  and  provide  these  trimmed  estimators  for  the 
exponential  distribution. 


2.4  Truncated  Estimation  for  the  Exponential  Distribution 

In  this  section  we  modify  the  estimation  from  0  ^  u  <_  1  to 

0<p_<u_<l-p<l  for  the  two  parameter  exponential  distribution. 

Since  f  Q  (u)  =  1  -  u  does  not  satisfy  the  left  tail  con- 
oo 

1  2 

dition  that  lim  —  (f  Q  )  (p)  exists,  we  can  not  estimate  0  using 
p  oo  r 

p-+0  y 

all  0  _<  u  <_  1.  The  following  theorem  implements  simultaneous 
estimation  of  9  and  based  on  calculating  truncated  inner  products 
as  defined  in  formula  (1.13)  with  u  truncated  tp  the  interval 
0  <  p  £  u  <_  1  -  p  <  1. 

Theorem  2.6:  For  the  location  and  scale  exponential  density  and 
0  <  p^uj^l-p  <  l,if  we  use  Parzen's  D(u)-u  representation,  we 
have 


(l-AH®,  =  E"1  £  , 


where 


1  -  (oij)  and  ^  =  (gi> 


11 


P 


and 
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°12  "  °21  "  ^  108  (1  ’  P)  » 

°22  "  lo82(l-p)  +  (l~p)  log  (1-p)  -  p  log  p  , 

i  -  -  i  //R< 

gi  •  *7*  D(p)  +  D(1-p)  -  „  £fp»  i-pi)  -  1  * 

and 

1  #R<  ,  R.  ,  . 

g2  "  n  (N+I  E[P*  1-P»+  i  RZ  i0^1-  ^-Dtp)1^  log(l-p) 

N^[P,1-P1 

-D(l  -  p)  log  p  +  log  p  , 
which  are  all  calculable  from  the  data  and  p. 

Proof :  Par2en  (1979,  1980)  gives  reasons  for  using  0<p^u_<l-p 
<  1.  Thus,  we  have 


-/P1"Pj„2(u>du*i<  w2<”> 

*  ■£;  'W2'" 

.  uji-p  +  u=£ii 

IP  P 

.  i  -p+ilzEi! 

P 

»  1=2. 

P 
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Also , 


r  1"P(-i)[l-log(l-u)'1]du 

*  n 


0 21  “  ° 12  “  <foQo*Qo(foQo)>p,i-p  '  P 


+  -(l-p)2log(l-p)*1+^?2lo*^"1) 


/  log  7  dy-l+2p 

_(lr£>I  log(l-p)-p  log  ? 
P 


_  ( 1+ 1^2. )  ( 1-p )  log  ( 1-p) 
v  P 


i^2  log  (1-p) 
P 


Next, 


o22  .  <Qo(u) f0Q0(«) .  f0Qo(u) > 

„  |  L-p[l-log<l-urfd^  log2(l-p)'l(l-P)‘+^lo*‘,P  V 


!  l“pu+i<»g(i-«»)l  du 
'  p 


2  2 
2du  +  lil2lliog2(l-p)  +  P  log  P  , 
? 


where 


+  1'plog2  u  du 
'  p 
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-  l-2p  +  2 [ (1-p) log(l-p)-l+p-p  log  p+p] 

+[ (l-p)log2(l-p)-(l-p)log(l-p) 

2 

+l-p-p  log  p+p  log  p-p] 

2  2 

-  (l-p)log  (l-p)+(l-p)log(l-p)-p  log  p-p  log  p. 

Combining  the  above  expressions  yields 

o22  -  ^  log2 ( 1— p) +( 1— p) log ( 1— p ) — p  log  p  . 

For  ,  we  have 

g,  -  <f  0  ,D(u)-u>  .  -  /  (u)]d[D(u)-u]+if  Q  (p)[D(p)-p] 

1  o  o  p^l— p  p  o  poo 

+  ^  foQo(l-p)[D(l-p)-l+p] 

m  f  1  ^ du-f ^  ^dD(u)+  — ' ^[D(p)-p]+D(l-p)-l+p 
P  P  P 

R 

"  (l-2p)-  ~[<hjrre[p»l-P]  1+  D(p)-l+p+D(l-p)-l+p 

m  N"ri  p 

-  D(p)+D(l-p)-l-  £  t  [p *  1— p] }  • 

Finally, 

g2  ’  <Q0<foQo)»D(u)-u>p.l-p 
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-  /  1-p[l+logU-u)ld[D(u)-u3+  — (l-p)log(l-p)[D(p)-p] 

P  P 

+  ^2.  iog  p  [D(l-p)-l+p] 

P 

R  R 

■  i  f^;ecp.i-pii+  i  I  lost1-  5?t>-<1-2p> 

& . 

jj+l  £  f  P » P 1 

-  /  ^~plog  u  du-  -^-2log(l-p)D(p) 

P  P 

-  (log  p)D(l-p)+(l-p) log(l-p)+(l-p) log  p 

R  R 

-  ^t^etp.l-p]  1+  £  Ilogd-  P-  ~^log(l-p)D(p) 

j^ietpd-p] 

-  log  p  D(l-p)  . 

Corollary  2.4:  For  the  assumptions  of  Theorem  2.6  andvith  o  <  the 
results  of  Theorem  2.6  for  £  simplify  to 

h  -  -1  and  g2  -  1  +  i  l  logd  -  • 

Ri  £[P,1«P) 

N+l 

1  1  *j  ~ 

Proof:  Since  p  <  — ,  we  have  ^or  *■»  D(p)*0,  D(l-p)"l, 

Ri 

and  c[p,l-p])  ■  m,  thus  giving  and  g„  as  desired. 


V 
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One  may  still  use  all  of  the  data  when  using  this  corollary  and 

its  estimates  of  6  and  i(f.  The  asymptotic  approximations  are  obtained 

by  replacing  all  <f,,f  >  with  <f,,f  >  although  the  distributions 
1  i  l  l  p,q 

may  not  hold.  We  also  note  that  the  left  tail  is  where  the  tail  con¬ 
dition  is  not  satisfied  and  that  one  may  desire  to  use  0  <  p  <  u  ^  1 
as  a  basis  for  the  estimation  with  this  density.  Other  than  the 
corollary,  we  offer  no  choice  for  p  at  this  time.  A  topic  of  further 
research  is  to  choose  p  to  minimize  a  criteria  such  as  variance  or 
mean  square  error  of  8  and  4> . 

In  section  2.5  we  give  some  remarks  on  some  finite  sample 
size  distributions  for  0  and  >j>  for  0  u  <_  1. 

A  A 

2.5  Finite  Sample  Distribution?  of  8  and  v 

In  this  section  we  discuss  finite  sample  size  distributions  of 

0  and  'll .  These  are  obviously  needed  when  n  and  ;n  are  not  large. 

They  would  also  be  very  useful  in  seeing  how  large  n  and  m  will 

need  to  be  in  order  to  use  the  asymptotic  results. 

First,  under  Hq:  F  *  G  we  know  that  each  possible  ordering  of 

the  {X1>  and  {Y^}  in  the  combined  sample  is  equally  likely.  One 

may  thus  enumerate  all  possible  rankings  and  record  8,  it/,  D(u) ,  and 

D(u)  for  each  ordering.  This  yields  the  complete  distributions 

under  H  . 
o 

Under  Ha:  8  ^  0  or  i|j  0  the  rankings  are  not  equally 

likely  and  the  problem  is  more  complex.  We  must:  (a)  find  it, 

(b)  simulate  it,  or  (c)  approximate  it.  This  is  a  topic  for 


further  research. 
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Another  source  for  the  finite  sample  size  distributions  of  9 
and  il>  arises  from  the  fact  that  they  are  often  simply  linear 
transformations (which  are  monotonic)  of  classical  linear  rank 
statistics  which  often  already  have  finite  sample  size  tables 
available.  For  our  use  we  merely  take  the  appropriate  linear 
transformation  of  the  tabled  critical  values  using  the  following 
theorem. 

Theorem  2.7:  If  9 (or  is  a  linear  transformation  of 
N 

T  ■  [  c , a (R  )  and  finite  sample  tables  of  percentiles  for  T  are  availa- 

i=l  1  1  . 

ble,  then  these  tables  easily  yield  percentile  tables  for  0(or  ip)  . 

Proof :  Table  l(p.  9)  gives  values  of  a  and  b  for  various  linear  rank 
test  statistics  T  which  have  tables  available,  such  that  a  9  +  b  **  T. 
Clearly, 

A 

a  -  P(T  >  t  )  -  P(a0  +  b  >  t  ) 

—  a  —  a 

-  P(9  >  a_1(ta  -  b))  . 

If  a  +  b  ■  T,  we  have 

a  -  P(T  >  t  )  *  P(<|>  >  a_1  ( t  -  b))  . 

—  a  —  a 

Theorem  2.8:  For  f  symmetric,  an  approximate  a  level  finite 

sample  size  test  for  the  simultaneous  Hq:  0  »  «  0  is  given 

from  a  size  a,  test  of  H  :  9  -  0  and  size  a,  test  of  H  :  to  »  0 
1  o  2  o 

where  ■  1  -  A -a 


Proof:  Under  H  ,  P(rej.H  :  9  -  0)  -  1,  -  P(re1,H_:  y  •  0)-  a,  • 

1  '  0  0  10  L 

1  -  /I  -  a  .  Than, 

P(rej.  9  ■  0  or  raj.  $  -  0)  •  2(1  -  *'l  -a  ) 

-  P(raj.  H  :  9-0  and  raj.  H  :  hi  -  0)  . 

J  o  J  o 

Since  f  Is  svmmatric,  9  and  are  asymptotically  independent,  so 

o' 

_ _  2 

P(rej.  Hq:  9-0  and  rej.  Hq:  'i>  -  0)  -  (1  -  *1  -  a) 

Then, 

P(rej .  Hq:  9  -  l>  -  0)  -  2  -  2  A  -  a  -1  +  2  /l  -  a  -  (1  -  a) 

"  x  , 

as  desired. 

These  methods  for  tescing  Hq:  F-G  are  based  upon  linear  rank 

statistics,  as  the  test  statistics  are  functions  of  linear  rank 

statistics  with  score  functions  determined  by  the  assumed  model  f  . 

o 

In  order  to  model  the  data  and  to  obtain  more  accurate  and  interpretable 

tests  and  estimators,  we  will  develop  methods  to  detannine  which  of 

several  f  ’s  best  fit  the  data.  We  begin  this  development  in  the 
o 

next  section. 
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3 .  MODEL  SELECTION 

In  this  section  we  begin  developing  criteria  to  select  a  model 

for  D(u).  For  example,  in  ordinary  regression  analysis  one  often 

2 

makes  a  choice  among  models  based  on  R  or  predictability  of  the 
dependent  variable  or  interpretability  of  the  coefficients  of  the 
independent  variables.  Such  criteria  may  be  developed  for  the 
approach  we  take  in  the  two  sample  problem. as  given  below,  e.g.  Theorem 
3.2.  In  particular,  we  will  develop  criteria  for  determining  whether 
fQ  models  the  data  adequately  or  whether  f  and  a  location  parameter  or 
scale  parameter  difference  adequately  models  the  data,  D(u) .  In  this 
case,  the  difference  between  the  predicted  and  the  observed  values  is 
D(u)  -  D(u) ,  a  stochastic  process  for  u  e (0,1) .  We  state  the 
asymptotic  distribution  theory  for  D(u)  -  D(u)  in  section  3.1  and 
suggest  some  measures  of  fit  for  the  various  f  densities  in  the 

O' 

results  of  section  3.2.  It  is  the  measure  of  distance  between  D(u) 

and  D(u)  that  will  allow  one  to  choose  f  which  best  models  the 

o 

data. 


3.1  The  Asymptotic  Distribution  of  D(u)  -  D(u)  under  Hq 
We  develop  this  distribution  as  follows: 

(1)  From  Pyke  and  Shorack  (1968)  and  our  Theorem  2.1  we  have, 

under  H  , 
o 

t  1_X  u 

[D(u)  -  u]  k  (-y-^K  B (u)  . 

o 


1 
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(2)  Using  section  2.3,  Theorem  2.5,  we  will  have,  under  H  , 

o 

/N  [D(u)  -  u]  ■+  Z1  (u) , 
where  Z^(u)  is  a  zero  mean  normal  process 

(3)  Than,  under  H  , 

o 

»f[D(u)-  u]  -  '/N  [D(u)  -  uf-  /N  [D(u)  -  D(u)  ]  , 

and  we  will  show,  under  H  , 

o 

1-X  . 

»i[D(u)-D(u)]  *  Z(u)  -  Z1(u)-(~Y-a)^  B(u)  , 

o 

where  Z(u)  is  a  known  0  mean  normal  process  given  fQ. 

A  • 

One  way  to  characterize  Z(u)  is  directly  from  D(u)  and  D(u) . 

That  is,  D(u)  is  a  functional  of  D(u)  and  we  know  the  asymptotic  dis¬ 
tribution  of  D(u) .  Perhaps  a  more  elegant  way  to  study  Z(u)  is  to  use 
the  Brownian  bridge  representation  of  «^i[D(u)-u]  and  /n[D(u)-u] 
by  studying  Z^u)  -  c  B(u).  These  arguments  are  illustrated  in 
the  following  theorem,  for  F-G  and  fQ  symmetric. 

Theorem  3.1;  Under  the  conditions  of  theorem  2.5,  we  have 

[D(u)  -  D(u) ]  -  Z(u)  , 

where  Z(u)  is  a  0  mean  normal  process  with  covariance  kernel  for 

0  <  u,  <_  u,,  <  1 
-L  4m 


' 


V 
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1-A  f  Q  (u  )f  0  (u,) 

„  /  /  0\  f  /  i  \  OOiOOfc 

W-2  ■  <T-)Iui(1-“2>  -  7TTT— ; - 

o  /_  J  (u)du 

U  O 


Q.Cu,  ) 

olooloZooz 
/01[l-Qo(u)Jo(u) ]“du 


]  • 


1-X 


o  h 

Proof:  Let  c  =  (— r — )  .  Then,  as  in  Theorem  2.5, 


^[D(u)-  u]  =  /01[foQo(u)W1(y)+go(u)foQo(u)W2(y)]d[/N(D(y)-y)] 


and, 


[D (u)-  u]  -  /0  Iu(y  <_  u)d[^N(D(y)-y)  ]  , 

where 

I(y  <_  u)  »  1  ,  y  £  u  , 

-  0  ,  y  >  u  . 

Subtracting  (3.2)  from  (3.1),  we  have 

>/n[D(u)-D(u)  f  ^(u)W1(y)+f2(u)W2(y)-Iu(y_<u)  ]d[/N(D(y)-y)  ] 

^cJq^ [ f l(u)Wl(y)+f 2 (y)^2  (y)-Iu(yfu)  ]dB(y) 
=c[f1(u)/0Hj1(y)dB(y)  +  f2(u)/01W2(y)dB(y)-B(u)]  . 
Thus,  the  asymptotic  mean  of  ^[DCu)  -  D(u)]  is 
E{Z(u)}-cf1(u)E[/01W1  (y)dB(y)  ] 


+  cf2(u)  E[/01W2(y)dB(y))  -  c  E[B(u)] 


0  , 


(3.1) 

(3.2) 


since 


<  i  , 


n*W^(v)dv  »  /  *  'i*\(.v)dv  =  0  .  I.ettinR 


we  have 


11 -> 


co  v  |  /.  (  )  ,  Z  (  u  ,  )  !  -  c~  Kf  (  I  j  ( u  j  ' K  ^  (  v  )  d B  ( v  ) 

+  f  ,(u1)/r)1W,(v)dB(v>-P(u1H 

'!fi(u2)-'"olwi(v)dB(v) 

+  f2(u2)/olw,(y)dB(v)-B(u2) ]} 

“  c“{f1(u1)f1(u2)E[/01'-1(y)dB(y))2 

-  E[B(u2)/01Wi(v)dB(y)]f1(u1) 

+  f2  (u1)f2(u2)E[/o1W2(y)dB(y)  ]2 

-  E[B(u2)/01W2(y)dB(y) ]f2 (u^) 

-  f1(u2)E[B(u1)/01Wl(y)dB(y)] 

-  f2(u2)E[B(u1)/Q1W2 (y)dB(y) l+u^d-i^) } 


=  c"{  u1(l-u2)+ 


fi(ul)  f1(u2)  f2(ul)f2(u2) 


fnlJ  2 (u)du  / _ 1 [ 1 -Q  (u)J  (u)]2du 
U  O  U  O  O 


W  f 1 <u2 )  f2^ul^f2^u2^ 

/  2 (u) du  /  ^[l-Q  (u)J  (u)]2du 

U  O  IJ  o  o 

fl(u2)fl(ul)  _  f2(u2)f2(ul}  } 

/01,ot(ll)du  /01U-Qo(u)Jo(u)]2du 


I 


c_{u, (1-u,)- 


1 1  (ul)  f  1  )  f2(u1)f,  (u,) 

/oljo’(u)du  Jq]U-Q0(u)Jo(u)  ]'du 


VU  1*U2J* 


since  f, (u)  *  f  Q  (u)  and  f„(u)  -  Q  (u)f  Q  (u) • 

1  0  0  i.  0  0  0 

Corollary  3.1:  For  Che  assumptions  of  Theorem  3.1,  u-(u.,  ,u,  , . . .  .iu.)  ' 
D(u)“  [D(u^)  , . . .  .DCu,^)  ]  ,  and  D(u)«[D(u^)  .,D(u^)  ]  ,  we  have 

[D(u)  -  D(u) ]  °  ^  (0,  :k)  , 

where  Z,^  -  (o^),  cr_  -  K^Cu^.u^),  and  Mk  denotes  the  multivariate 
normal  distribution. 

,  » 

Mow,  for  u  *  1  u,  ,u^ , . . .  ,u,^:  we  use 


'N  [D(u)  -  D(u)]  -  Nk(0,  Zk) , 


where  Zk  •  (j^)  as  ab°ve  under  the  assumptions  that  iQ  is  symmetric 

and  3  and  v  are  "small".  Mote  that  Z,  depends  on  the  underlying  f  . 

<C  3 

“  •  _  i 

In  essence,  by  studying  D(u^)  -  Ofu^)  for  u^  ■ 

i  *  1,  . ...  M,  we  study  the  residuals  of  a  regression  model.  One 

may  further  study  the  application  of  classical  methods  for  analysis 

of  residuals  in  regression  a nalysis  to  these  residuals. 

One  mav  then  determine  the  quantiles,  u^,  where  D(u)  fits 

the  data  well  and  where  it  fits  the  data  badly.  The  main  use  for 

the  distribution  of  0(u)  -  D(u)  is  as  an  indicator  of  how  well 

an  underlying  f  density  will  model  the  data,  D(u) .  Ve  give  some 


v 


methods  for  determining  the  fit  of  D(ij)  to  D(u)  in  the  next  section. 


3.2  Some  Measures  of  Fit 


From  the  distribution  theory  for  D(ii)  -  D(u)  we  may  devise 
test  statistics,  whose  distributions  will  provide  a  test  of  the 
location  scale  model  with  f  specified  as  the  underlying  family. 
Computing  these  values  for  several  hypothesized  f  families  will 
allow  us  to  choose  the  most  appropriate  f  for  modelling  the  data 
as  a  local  location  and  scale  difference.  This  also  may  indicate  a 
location  scale  model  is  not  viable  or  that  9  and  are  too  large, 
if  we  determine  D(u)  does  not  fit  D(u)  within  chosen  limits. 


We  may  accept  D(u)  as  an  adequate  description  of  the  data 
provided  it  matches  D(u)  at  {u^,  i  =  1,  . . . ,  k)  where  the  u^ 
are  fixed  at  some  particular  percentiles  of  interest,  in  the  sense 
of  Eubank  (1979).  Another  set  of  u^  of  interest  may  be  the  data 


points,  i.e. ,  u. 


N+l 


;  i  =  1,  ....  N. 


Theorem  3,2:  If  the  conditions  of  Theorem  3.1  hold,  then  a  measure 
of  the  fit  of  D(u)  to  D(u)  is 


<5D2(u)  =  NY[D(a)-D(u)]'  l  [D(u)-D(u)  ]  2  x2  (k) 

k 

2 

where  X  (k)  denotes  the  central  chi-square  distribution  with 
k  d.f. 


Proof :  Assuming  Theorem  3.1  results,  this  is  a  standard 


application  of  the  distribution  of  quadratic  forms  of  normally 
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distributed  vectors.  The  degrees  of  freedom  depends  on  the  number 

of  u ^  chosen,  i.e.  {u^;  i  =  1,  ....  k}  . 

We  may  also  choose  which  of  the  f  seem  to  model  the  data  well 

o 

by  calculating  Fisher's  extension  of  Mahalanobis'  distance  as 
defined  in  Kshirsagar  (1972).  That  is,  we  use 

S^Cu)  ={Ny[D(u)-D(u)  ]  [D(u)-D(u)  ] )  1 

as  a  standardized  distance  measure  of  D(u)-D(u)  for  each  f  . 

- -  o 

We  choose  the  f  which  yields  the  smallest  6„(u)  as  that  f  which  hest 
o  D  —  o 

models  D(u)  by  D(u) .  Eubank  (1979)  gives  some  indication  of 

optimal  u  values  to  choose  for  each  f  . 

i  o 
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4.  DATA  ANALYTIC  COMP'1  FiSONS  WITH  OTHER  APPROACHES 

In  this  section,  we  analyze  three  data  sets  from  the  literature. 
The  kneecap  data  in  section  4.1  will  illustrate  what  information  our 
methods  provide  when  we  fail  to  reject  H^:  F=G.  The  rat  data  in 
section  4.2  illustrates  a  rejection  of  due  mainly  to  the  location 
difference  and  provides  some  interesting  comparisons  with  other 
methods.  The  coronary  heart  disease  data  sets  in  section  4.3  also 
illustrate  rejection  of  Hq,  but  the  fit  of  the  model  suggests 
further  analysis.  We  only  analyze  the  marginal  distributions  of 
the  two  bivariate  components  of  these  coronary  heart  disease  data. 

We  would  like  to  thank  David  Scott  for  his  kindness  in  sending  us 
a  listing  of  his  unpublished  coronary  heart  disease  data  for 
analysis  and  comparison. 

4.1  The  Kneecap  Data  in  Switzer  (1976) 

Switzer  (1976)  analyzes  two  sample  data  with  his  techniques. 

The  data  set  is  given  in  his  Table  1  as  right  kneecap  congruence 
angles  in  degrees  for  40  male  subjects  and  40  female  subjects.  We 
know  the  data  was  supplied  by  R.  G.  Miller,  but  do  not  know  the 
questions  it  was  gathered  to  answer.  Consequently,  any 
analysis  is  limited. 

Switzer's  analysis  gives  94.5%  confidence  bands  on  t  »G  ^F. 

o 

The  figure  (Switzer's  Figure  1)  appears  linear,  where  the  bands 
are  not  infinite,  and  suggests  a  location  scale  model  for  the 
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differences  between  male  and  female  right  congruence  kneecap  angles. 

Wilkand  Onanadesikan  ( 1968) have  named  a  plot  of  q  versus  G-1  [F(q)] 

a  Q-Q  plot  and  pointed  out  that  linearity  means  Y  is  a  location 

scale  transform  of  X  in  distribution.  Our  9  and  t/>  will  estimate 

this  relationship.  Switzer  also  gives  94.5%  confidence  sets  for 

max  [t  (q)-q],  min  [t  (q>  — q ] ,  and  (-5+10)  1/  ^  (t  (u)-u]du 
-10,-5  °  -10,-5  °  “10  ° 

as  [-5,12],  [-7,8],  and  [-5.9,10.1].  He  points  out  that  these  were 
obtained  for  a  limited  range  and  for  Smirnov's  confidence  procedure 
only.  Otherwise  finding  such  sets  is  much  more  difficult.  Switzer 
estimates  9  in 

(a)  t  (uO  =  (j  +  9  (b)  t  (co)  =  2id  +  9 

o  o 

for  three  different  confidence  procedures  to  obtain  the  results  below. 


Procedure 

Median 

Quantile 

Smirnov 

(a) 

-5,7 

-5,11 

-2,8 

(b) 

4,16 

5,21 

12,14 

These  models,  (a)  and  (b),  are  special  cases  of  a  general  location 
scale  difference  between  X  and  Y.  He  regards  the  short  confidence 
interval  for  9  in  (b) ,  from  a  Smirnov  based  procedure,  as  in¬ 
dicative  of  2  being  a  bad  value  of  the  slope  to  fit  these  data  and 
suggests  fitting  general  parameters. 

Switzer  then  parametrizes  t  (w)  as  follows: 

o 

(a)  t  (w)  =  (1  +  X)  u  +  9 

o 

(b)  t^(w)  *  «  +  0/(1  +  Aw),  A, 9  0  . 


In  (a)  Che  creatmenC  effect  increases  with  <o  and  in  (b)  Che  treatment 


effect  decreases  as  u  increases.  Swili;?r  then  reports  joint  con¬ 
fidence  intervals  for  0  and  X  in  models  as  given  in  his  Figure  2. 

In  the  Parzen  approach  we  hypothesize  a  general  location  scale 

U  -U  0  _  — 0 

1  1  2  1 

model  and  allow  0  *  — -  and  i p  ■  — — -  to  be  positive  or  negative 

obtaining  qualitatively  similar  models  to  (a)  and  (b) .  Since  we 
assume  a  location  and  scale  model  for  G  and  F,  we  have 

t  (ui)  -  G~1F(oj) 
o 

■1*2  +  02 


iD-y 

-  u ,  +  o  0  [F  <— i)] 
2  2  0  0  0^ 

u*-U1 

*  u2  +  a2  (“) 


where 


Ui  °2 

y,  -  —  +  —  <*» 
2  °1  °1 


y, - +  (1  +  m  , 

2  °l 


°2-°l 


So,  Switzer's  0  in  model  (a)  is  y„  -  —  and  Switzer's  X  is  our  4>, 

2  °1 

except  that  our  can  be  negative  also.  We  add  the  assumption  that 


the  distributions  of  Y  and  X  are  of  the  same  family,  F  . 


I 
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Our  analysis  of  Switzer's  kneecap  data  yields  the  following 
results.  A  comparison  of  the  quantile  functions,  and  Q^,  in 
Figure  A  suggests  >  and  °2  K  °i  or  9  >  0.  ♦  <  0. 

Three  choices  of  f  yield  the  following  estimates. 


A. 

p  values 

f 

0 

9 

/V(0) 

* 

W(ip) 

for  H  :  9=^=0 
o 

Normal 

.206 

.233 

-.134 

.158 

.46 

Logistic 

.400 

.387 

-.214 

.187 

.31 

Cauchy 

.145 

.316 

-.100 

.141 

.70 

regardless  of 

f  we 
o 

fail  to 

reject 

H  and 
o 

further  remark  0 

and  ip  are  all  within  two  standard  deviations  of  zero. 

A  quick  comparison  of  D(u)  -  D(u)  graphs  in  Figure  B 
gives  an  indication  that  the  logistic  density  may  fit  the  data 
best  with  D  rising  faster  than  D  in  all  three  cases.  This  indicates 

I  I 

F  /G  *  f/g  >  1  at  those  quantiles.  The  tests  pnd  estimates  of 
section  3  have  not  been  implemented  in  the  computer  program  yet. 

4.2  Doksura  and  Sievers  (1976)  Rat  Data 

These  authors  have  also  developed  techniques  for  estimating  a 
general  function  t(x)  where  F(x)  =  G[  x  +  t(x)],  or  t(x)  =  G  ^F(x)-x. 
In  fact,  Doksum  (1974)  has  developed  the  asymptotic  distribution  of 
t(x)  =  G  ^F(x)  -x.  The  questions  of  interest  in  their  paper  for  the 


»•  \ 

i-  ,  \ 

*  V 


two  sample  problem  are 


SWITZER"  KNEECAP  OPTO  (FIRST  =FEUOIeI.  SECOND-HOLE 
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(i)  Is  the  treatment  beneficial  for  all  the  members  of  the 
population,  i.e.,  is  t(x)  >  0  for  all  x? 

(ii)  If  not,  for  which  part  of  the  population  is  the  treatment 
beneficial,  i.e.,  what  is  (x:  t(x)  >  0}  ? 

(iii)  Does  a  shift  model  hold,  i.e.,  is  t(x)  *  9,  for  some  0 
and  all  x? 

(iv)  If  not,  does  a  shift-scale  model  hold,  i.e.,  is 
t(x)  *  a  +  6x,  for  some  a  and  8  and  for  all  x  ? 

All  these  are  answered  by  giving  a  confidence  band  ,  [t#(x) ,t*(x) ] , 

for  t(x)  simultaneously  for  all  x. 

Doksum  and  Sievers  develop  "nonparametric"  confidence  bands 
by  inverting  a  distribution  free  Kolmogorov-Smirnov  test  statistic 
for  Hq:  F*G .  This  is  their  S-band.  They  give  an  approximate 
weighted  band  (W-band)  based  on 


Wn  ,  J*  aup  ilixhW 


<HH(x)  } 


u 

where  ( t )  -  (t(l-t)K  .  They  remark  that  this  i|/  maximizes  the 
minimum  power  against  H^:  F-G  >_  5  for  some  6  >  0.  They  give  a 
third  nonparametric  confidence  band  (R-band)  based  on 


F(x)-G(x) I 
H(x) 

the  Renyi  statistic.  The  authors  present  some  tables  showing  why 
they  prefer  the  W-band  to  the  R-band  or  S-band  except  when  small 
quantiles  are  of  interest.  Finally,  when  one  is  given  a  location 


*N 


m  sup 


The  Parzen  approach  suggests  differences  in  scale  and  location 
by  observation  of  the  two  groups  quantile  box  plots (Figure  C)  when  the 
suspected  outlier  is  included,  and  a  lesser  difference  in  location 


i 


with  no  difference  in  scale  when  the  outlier  is  deleted. 
Graphical  comparisons  of  the  quantile  functions  for  each  group 
suggests  a  lowering  of  location  from  ^22.5  to  ^11  with  exaggerated 
loss  at  lower  quantiles  and  much  less  exaggerated  gain  at  upper 
quantiles . 


Order 


Statistics 

Xr 

XQ 

X,  , 

X,  „ 

x.„ 

x„„ 

5 

8 

15 

17 

19 

22 

Qy  15.4 

17.7 

21.4 

26 

26.6 

27.4 

38.4 

QX  ~12'9 

-9 

6.6 

15.5 

17.9 

28.2 

54.6 

a  28.3 

26.7 

14.8 

10.5 

8.7 

-.8 

-16.2 

This  leads  us  to  remark  that  t  actually  levels  off  at  less  than  20 
at  the  upper  quantiles  of  x  while  it  goes  much  below  -20  at  the 
lower  quantiles  of  x  until  the  supposed  outlier  is  encountered. 
With  this  reinterpretation  of  their  t  one  would  agree  that  the  two 
approaches  seem  quite  consistent,  although  we  do  not  report  a 


confidence  band  for  A,.,  in  this  table,  see  Theorem  6.2, 


Continuing  with  the  Parzen  approach,  we  obtain  estimates  of 


u2_l‘l  V°l 

0_  -  ancj  p  = - for  the  following  f  families  with  the 


1  “1 
’outlier"  included: 


- - 

— — 

P  Values 

f 

0 

9 

/V(0) 

for  H  :  0= 
o 

Normal 

-  .687 

.298 

.396 

.211 

.012 

Logis  t  ic 

-1.531 

.517 

.503 

.249 

.002 

Cauchy 

- 1 . 780 

.422 

.  18  5 

.189 

.00008  . 

*  o 


All  -  are  within  2- '  of  3  except  the  logistic  which  is  borderline, 

and  all  6  are  outside  2c ~  of  0.  So,  the  two  samples  differ 

significantly  in  location  and  perhaps  in  scale  if  we  assume  the 

logistic  f  . 

o 

Further  analysis  was  done  omitting  the  "outlier"  with  similar 
but  slightly  more  revealing  results.  The  quantile  functions  again 
suggest  more  extreme  shifts  in  the  lower  quantiles. 


u 

.1 

.3 

.5 

.7 

.3 

.9 

Q  Control 

15.8 

20.5 

23.5 

26.2 

27.4 

29 . 2 

Q  Treatment 

-14.3 

0. 1 

11.1 

15.6 

19.4 

36.3 

-30-1  - 

■  20.4 

-12.4 

-10.6 

-3.0 

7 . 6 

so  remark  that 

if  che 

-16.9 

of  Che 

control 

group 

was 

outlier  then  perhaps  the  54.6  of  the  ozone  group  is  an  "outlier" 

also.  One  might  conjecture  by  throwing  out  more  of  the  tail  behavior 

in  these  data  chat  the  normal  D  would  be  the  best  fitting  D(u) 

model.  Examining  the  D(u>D(u)  graphs  (Figures  D/E)  ,  seems  to  indicate  the 

Cauchy  f  does  well  with  the  outlier  in  and  the  logistic  f  does 
o  o 

well  with  the  outlier  left  out.  We  would  racher  accept  the  extreme 
behavior  of  these  rats  weight  gains  unless  some  explanation  could  be 
given  as  to  the  cause  of  an  error  in  Che  measurements  resulting  in  an 
outlier.  We  also  report  the  estimates  of  9  and  q  with  the  "outlier" 
left  out.  Then,  (X,-^)^  -  (11.01  -  24. 19) /6. 68  -  -1.97  and 


(S2-51)/S1 


(19.32  -  6 , 63) /6 . 68  -  1.35 


?  values 


‘o 

5 

3 

* 

for  H  :  c=t 
0 

Normal 

-  .341 

.30 

.  556 

.213 

.0007 

Logistic 

-1.73 

.52 

.699 

.25 

.00009 

Cauchy 

-1.75 

.  -o 

.  734 

.19 

.0001 

Mow,  without  the  one  data  point  all  9  are  bevond  2c~  0f  0  and  onlv  the 
Cauchy  -j  is  within  2r.'  .  Either  the  data  are  distributed  Cauchv  ar.d 
the  samples  differ  in  location  or  the  data  differ  in  location  and 
scale  and  are  distributed  logistic  or  normal  or  some  unknown  other 
possibility.  In  either  case  it  appears  the  location  difference  is 
dominant,  since  9 ;  >  f  p  j  . 

We  also  note  that  deleting  the  one  possible  outlier  did  not 
afreet  rfre  -  _  or  ~  ]  very  much,  buc  did  affect  the  logiscic 
and  normal  d  and  i.  This  points  out  the  robustness  of  the  Cauchy 
model. 

4.3  Coronary  Heart  Disease  Data  in  Scott,  et  al.  (1978) 

David  Scott  of  Rice  University  presented  a  seminar  at  Texas 
A&M  where  he  analyzed,  for  two  groups  of  patients,  measures  of 
plasma  trygiycerides  and  cholesterol.  The  aim  of  our  analysis  is  to 
examine  their  relation  to  coronary  heart  disease.  In  the  control 
group  we  have  m  =  51  patie-ts  with  no  history  of  coronary  heart 
disease  and  in  the  treatment  group  we  have  n  »  320  patients  with  a 
history  of  coronary  heart  disease  (C.H.D.).  The  question  is  "How  do 
the  rwo  grouos  differ  in  trygiycerides  and  cholesterol  levels?" 

\ 


Scott's  analysis  estimated  the  bivariate  density  functions  of  each 
group  and  graphically  compared  them.  Although  there  is  a  Parzen 
bivariate  quantile  approach  in  the  making,  we  only  analyze  the 
marginal  quantile  functions  of  the  two  groups  at  this  time. 

First,  we  examine  the  tryglycerides.  Both  groups  have  similarly 
shaped  quantile  functions  (Figure  7)  indicating  the  distributions 
may  be  skewed  to  the  right.  The  C.H.D.  group's  tryglycerides  tend 
to  be  higher  but  also  spread  over  the  non  C.H.D.  group's 
tryglycerides  for  approximately  the  lower  quart ile. 


On  examining  9  and  <l>  we  see  the  predominant  difference  is 
clearly  a  shir t  in  location  rather  than  scale. 


f  6 

0 1  'll 

^  ' 

p  values 
for  H  : 

0 

""  n 

0 

Normal  .441 

.15  .C06 

.11 

.014 

Logistic  .797 

.26  -.004 

.13 

.009 

Cauchy  .431 

.21  -.013 

.09 

.013 

Here  D(u)  for  the  logistic  seems  to  match 

D(u> 

Che  best  (Figure  G 

The  descriptive  A 

again  suggests  a 

skewed 

distribution  for 

since  A  (u)  increases  with  u  and  "■>  '  0. 

u  .125  .25  .5 

.75 

.375 

Q  for  non  CHD 

32  91  120 

160 

195 

Q  for  CHD 

91  115  150 

213 

284 

?  24  30 

58 

39 

Since  the  quantile  functions  give  an  indication  of  skewness,  there 


should  be  an 


o 


which  would  give  more  efficient  estimates  of  9  and  y. 
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We  now  examine  the  marginal  distributions  of  cholesterol  levels 


for  each  group. 

The  quantile 

functions 

(Figure  H) 

are  again  similar 

in  shape  but  the 

C.H.D.  group 

does  have 

a  longer  tail  suggested  and 

shifted  higher  suggesting  3  > 

0. 

p  values 

C 

3  o  j 

■V 

o ' 

for  H  :  0-v*O 

‘  0 

V 

0 

Mormal 

.51  .15 

-.03 

.11 

.003 

Logistic 

.87  .26 

-.05 

.13 

.OOA 

Cauchv 

.44  .21 

-.007 

.10 

.117 

We  conclude  cholesterol  levels  differ  in  location  only,  regardless 

of  which  of  the  three  f  are  assumed.  For  this  variable  A  is 

o  Q 

much  more  consistent. 


u 

.  i 

O 

.  3 

T 

.3 

.9 

0  C.H.D. 

168 

130 

191 

236 

243 

267 

non  C.H.D. 

L50.5 

161 

169.5 

208.5 

->->2 

239 

17.5 

19 

21.5 

27.5 

26 

28 

Q 


We  also  remark  that  the  graphs  of  D(u) -D(u) (Figure  I)  suggest  the 
logistic  fQ  may  model  these  data  well. 


A. A  Remarks  on  Examples 

From  these  three  example  data  sets  we  see  that  the  quantile 
approach  agrees  with  results  of  ocher  authors'  nonparametric  techniques. 
We  agree  in  accepting  H  as  in  Switzer's  data  (section  A.l).  We  also 
agree  in  rejecting  as  in  Doksun  and  Sievers’  data 


FIGURE  H.  Quantile  Functions  for  Cholesterol  Data 
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(section  4.2).  Also  in  section  4.2,  Parzen's  technique  seems  to 

indicate  the  difference  of  the  two  groups  is  best  explained  by 

both  location  and  scale  shifts  rather  than  just  a  location  shift 

as  it  seems  Doksum  and  Sievers  may  have  believed  when  the  supposed 

outlier  is  thrown  out.  When  the  data  point  is  kept  in  we  best 

fit  the  differences  with  the  Cauchv  f  and  a  location  shift 

o 

similar  to  Doksum  and  S ievers  results .  At  any  rate  the  two  approaches 

partially  agree  and  partially  disagree.  The  Parzen  approach  also 

models  Che  data  and  gives  some  alternate  explanation  of  what  may  be 

going  on  in  these  two  groups  of  data.  This  points  out  how  crucial 

the  assumption  of  that  data  point  being  an  outlier  can  be.  And 

finally,  in  section  4.3  (David  Scott's  data)  we  see  we  can 

reject  H  with  various  null  families  quite  consistently,  yet  be 
o 

led  through  the  graphical  techniques  to  alternate  explanations 

beyond  the  analysis  performed.  That  is,  we  are  led  to  consider  some 

f  for  our  D  estimator  which  are  skewed.  Although  our  approach 
o 

may  new  test  H^:  0i,J/»0orFi,G  and  estimate  0  and  ip  for  several 

£  ,  some  skewed  or  short  tailed  densities  would  also  be  of  interest 
o 

in  modelling  some  data. 


\ 

v  V 

»  V 
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5.  OVERVIEW  OF  THE  LITERATURE  ON  NONPARAMETRIC 
ESTIMATION  AND  TESTING  OF  LOCATION  AND 
SCALE  PARAMETERS 

The  nonparametr ic  estimation  of  location  parameters  was  started 
by  Hodges  and  Lehmann  (1963).  Sen  (1966)  has  extended  this  technique 
to  scale  parameters. 

In  the  decade  of  the  1970's  researchers  developed  simultaneous 
estimates  of  both  location  and  scale  parameters .  This  section  re¬ 
views  the  relation  of  some  widely  used  location  and  scale  tests 
with  estimators  in  the  location  and  scale  model  for  D(u) . 

5.1  Location  Tests 

5.1.1  Linear  Rank  Tests  and  9 

Linear  rank  statistics  are  of  the  form 
N 

S  =  l  a(I,U  , 
i=l 

where  a  is  an  arbitrary  function  of  i  and  ,  is  a  relative  rank 
of  the  X  sample.  S  is  a  simple  linear  rank  statistic  if 

S  -  j  Cj  . 

i=l 

Many  of  the  statistics  for  the  two  sample  problem  that  have  been 
developed  are  simple  linear  rank  statistics. 


(i)  The  Van  der  Waerden  test  statistic  is 


m  R . 

T1  •  Jo  W  ■  -  *9- 

where  J  (u)  =  $  \u)  and  0  is  based  on  the  N(0,1)  f  . 
o  o 

This  was  developed  by  Van  der  Waerden  (1952)  and  is  asymptotically 
equivalent  to  the  Fisher-Yates-Terry-Hoef fding  normal  scores  test 
where  Jq(u^)  is  replaced  by  E[Jo(u^)],  Hajek  and  Sidak  (1967). 


(ii)  The  Wilcoxon  test  statistic  [Wilcoxon  (1945),  Mann  and 
Whitney  (1947)]  is 


T2  * 


I 

i=l 


2m  - 


m(N+l) 

6 


for  9  based  on  the  logistic  f  and  Q  given  in  Parzen  (1979)  . 

o  o 


(iii)  The  median  test  developed  by  Mood  (1950) ,  Westenberg 
(1948) ,  or  Mathisen  (1943)  is 

m 

T..  =  j  signfR.  -  j(N+l) ]  =  m  6, 

J  i=l 

for  9  based  on  the  double  exponential  f^  and  Q  . 


5.1.2  Exceedance  Tests  for  Location 

These  tests  obtain  their  name  from  the  fact  that  they  are 
based  on  the  count  of  one  sample's  points  which  are  either  above  or 
below  the  other  sample's  maximum  or  minimum  value. 

They  are  rather  special  tests  not  ordinarily  used  in  a  standard 
analysis.  The  following  are  taken  from  Hajek  and  Sid£k  (1967): 
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(i)  The  Haga  (1959)  test  with  work  by  Sidak  and  Vondracek 
(1957)  is  based  on  four  quantities:  A  =  If  of  X,  >  max  Y  , 

1  j 

A'  =  if  of  Yj  >  max  B'  =  If  of  <  min  Y.  ,  and  B  =  #  of  Y  <  min 
X^  (i=l,  m;  j=l,  n) .  Then  the  test  statistic 

Tl  =  A  +  B  -  A'  -  B' 

is  optimal  under  special  conditions  for  the  uniform  F  where  there 

o 

is  neither  an  optimal  rank  test  defined  or  a  9  test  unless  we  consider 

using  <  f  ,  f  ,  0  <  p  <  q  <  1.  However,  the  four  quantities 

p,q 

A,  A',  B  and  B'  are  related  to  various  comparison  functions  or  D 
and  we  mention  the  exceedance  tests  to  show  how  they  may  fit  into 
the  general  approach  taken  here.  When  D(u)  does  not  fit  D(u)  well 
we  may  use  the  Haga  test  as  it  is  related  to  various  0(u) .  For 
D^(u)  =  FG  1(u)  (  proportion  of  X's  jc  Y)  we  know  A  =  irfsize  of 

last  jump  in  D^(u)];  B'  =  m[size  of  first  jump  ’in  D^(u)]. 

Similarly,  define  A'  and  B  for  D2(u)  =  GF  (u) .  Thus,  the  Haga 
test  is  related  to  first  and  last  jump  sizes  in  the  two  comparison 
functions  D^(u)  =  FG  1(u)  and  D2(u)  =  GF_1(u). 

(ii)  Rosenbaum's  (1954)  test  is  a  simpler  test  designed  for 
the  alternative  that  Y  is  shifted  to  the  right,  9  >  0.  In  our 
notation  this  test  statistic  is  m[l-D(u  for  max  Y^)]  and  is  more 
easily  adapted  to  the  D(u)  used  here. 


V 

k 


1 


5.1.3  Goodness  of  Fit  Tests  for  Location 

These  tests  are  based  on  some  measure  of  the  distance  between 
F(x)  and  G(x) .  We  present  them  here  to  show  their  relation  to 
D(u),  and  thus  provide  a  wider  statistical  base  for  the  importance 
of  D(u)  and  D(u) . 

(i)  The  unnormalized  Kolmogorov-Smirnov  test  statistic  is 

T  =  max  |F(x)  -  G(x) \  .  Kolmogorov  (1933)  developed  this  for  a  one- 
x 

sample  test  and  Smirnov  (1939)  for  the  two  sample  test.  For 
D(u)  =  FH  L(u) ,  we  have 

T  =  sup  | D(u)  -  u | 

Durbin  (1973)  gives  a  derivation  of  the  distribution  of  sup|B(u)|  which 
may  be  used  for  studying  the  distribution  of  T.  Graphically  we  plot 
D(u)  -  u  versus  u  and  see  if  it  significantly  exceeds  0  in  absolute 
value  which  is  determined  by  a  given  critical  value  from  the  dis¬ 
tribution  of  sup  | B(u) |  . 


(ii)  The  Renyi  test  is  also  related  to  the  comparison  function 
and  weighted  by  H(x) .  It  is 

Ta  -  max  <n+m)jF(x).  '^(x)  j 
A  mF(x)  +  n  G(x) 

where  A  =  {x  :  (n+m)  ^[mF(x)  +  n  G(x)  ]  =  H(x)  >  a}.  Therefore, 
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(iii)  The  Cramer  Von  Mises  test  is  related  as  follows 


T  =  (n+m)  X  ( 1-A)  f°  [G(x)-F(x)  ]  2d  G-(~  -] 

rrrrn 


=  (n-t-m)  1  ( 1-1 ) [D (u) -u]  ^  du 

0 

where  u  =  H(x) . 


(iv)  Finally,  we  also  may  remark  that  Weiss  (1076)  gives  an 

analogy  which  shows  a  two-sample  test  of  H  :  D(u)  -  u  =  0  can  be 

o 

developed  from  anv  one  sample  goodness  of  fit  test.  Also,  Pettitt 
(1976)  gives  a  two  sample  Anderson-Darling  statistic 


-.2  _  .  n+m-1  [ M .  (n+m)  -n . 

.2  nm  f30  (F-G)  ...  .  1  r  l  l 

A  =  — —  j  — - —  dH  =  —  / 

nm  n+ra  _oo  H(1_H)  mn  .  =  1 


i ( n+m- i ) 


where  M.  =  nD  Similar  to  2.1.3  (iii)  we  obtain 

l  n+m 


.2  fl  lD(u)-u]‘  , 

A  =  nm  j  — r- - p—  du  , 

nm  1  0  u(l-u) 


where  u  =  H (x) . 


The  point  we  can  make  with  these  goodness  of  fit  tests  is  that 
they  are  all  functions  of  D(u)  -  u.  They  can  be  computed  from  the 
comparison  functions  and  all  measure  the  "size"  of  D(u)  -  u. 
Parzen's  D(u) ,  as  well  as  its  extensions,  attempts  to  model  D(u) 
and  we  will  want  to  minimize  the  "distance"  between  D(u)  and  D(u) . 
In  other  words,  we  want  D(u)  to  converge  to  the  truth  so  that  our 
estimators,  9  and  ijj,  are  consistent.  Other  location  tests  and 
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estimates  are  given  in  Table  3  (p.  29). 


5.2  Scale  Tests 

5.2.1  Linear  Rank  tests  and  i 

These  tests  are  of  the  same  form  as  in  section  5.1,  l.e., 

N  N 

S  =  >  a  (i,  8,,.)  or  /  c.  a(R.,.).  However,  the  score  functions 

Si  l  N  l 

1=1  1=1 

a(i,R.,.)or  a(R  )  are  different  in  that  uhev  are  devised  to  detect 
’  d  i  N  i 

differences  in  the  dispersion  or  scale  parameter  of  the  two  dis¬ 
tribution  functions  F  and  G. 

(i)  The  Klotz  (1962)  test  is 


h  ■  .1  tVi&'i2 '  <-■  y  • 

i=l 

for  f  ,  the  standard  normal,  where  J  (u)  =  t>  ^(u).  Hajek  and 

o  o 

Sidak  remark  that  tlie  Klotz  test  is  asvnptotically  equivalent  to 

Ri  2 

the  Capon  test  where  (J  ]  Is  replaced  by  its  expected  value. 

Our  estimators  6  and  '1  are  linear  transformations  of  the  Van  der 
Waerden  and  Klotz  tests  respectively  when  D(u)  uses  the  normal 
density  for  f  .  Each  is  an  asympotot ical Iv  optimal  test  for  f 
the  standard  normal  density  function. 


(ii)  The  Ansar i-Bradley  test  is 


m  1  1 
T  =  l  [y(nr+n+l)-lR.  -  ifm+n+1) 

1  i=l  “  1  ‘ 
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-  f  m+i)  -  -  ^j) 

=  |  (N+l)  -  (J-3)  , 

where  is  calculated  from  Parzen's  D(u)  with  fQ(x)  =  ^-(l+|x|)  , 

the  density  for  which  the  Ansar i-Br ad  ley  test  is  also  optimal. 

We  note  that  the  Ansar i-Bradley  test  is  formed  in  a  manner 
similar  to  the  Wilcoxon  test  for  location,  but  with  ranks 
modified  to  detect  the  scale  difference.  Sukhatme  (1957)  has 
also  introduced  a  modified  Wilcoxon  test  for  scale  differences. 


The  Siegel-Tukey  test  is  similar  but  allows  use  of  the  Wilcoxon 

tables  for  small  samples.  Though  the  Ansari-Bradley  and  Wilcoxon 

tests  are  formed  similarly  they  are  optimal  for  different  densities. 

Through  this  implementation  of  Parzen's  (1980)  approach  we 

obtain  both  location  and  scale  tests  for  each  density,  as  well  as, 

the  estimates  (these  were  given  in  Table  1  (p-  9)  for  completeness). 

For  example,  a  location  difference  test  and  estimate  for  f(x)  * 

1  -2 

T  (1+M)  can  be  obtained  from 


R. 


0 


m  £  3ign(,  -  min  (M+1 »  1  ~ 


i=l 


N+l’ 


The  quartile  test  for  differences  in  scale  seems  to  be  another 
example  of  a  nonparametric  test  for  one  parametric  difference 
which  has  had  no  corresponding  test  for  location  difference 
advocated  which  assumes  the  same  density.  In  Table  3  (p.  29)  we  give  the 
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test  statistics  which  we  advocate  for  this  f  . 

o 

(iii)  The  Quartile  test  developed  by  Westenberg  (1948)  is 


_  1  c  ,  N+l |  N+l,  ,, 

t3  =  j  l  lsign(|Ri j-\ +  H 

i*  1 

=  If  of  x  obs.  i(H_1(.25),  H_1  ( .75) ) 
-  m{  1  -  D( .  75)  -  DC. 25)  1  . 


It  is  related  to  the  comparison  function  that  we  use  here  and 
a  D(u)  model  can  be  obtained  from  the  density  for  which  it  is 
asymptotically  optimal,  i.e. 


f(x)  -  1  »  | x |  <r  , 


*  - 2  »  1*1  >  4  • 

16x 

Using  this  density  for  fQ  in  Parzen's  D(u)  model  we  obtain 

;  -  i- i  i  1 

2  ”  R^^CN+l),  |(N+1)] 

for  an  estimate  of  the  scale  differences  of  the  two  samples. 

The  location  difference  estimate  obtained  simultaneously  was  given 
in  Table  3  (p .  29)  . 

(iv)  The  Savage  test  (l.R.  Savage  (1956)]  is  asymptotically 


optimal  for  the  exponential  density  and  is  defined  as 
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The  exponencial  density-quantile  function,  f q [ Qq (u) ]  is  not 

in  the  RKHS  used  to  obtain  $  and  0  on  the  whole  interval  [0,1]. 

However,  if  we  truncate  the  interval  to  {^,  1  -  4]  we  may  still  use 

all  of  the  data  to  obtain  an  estimate  of  D(u).  However,  we  need 

an  algorithm  to  compute  6  and  $  for  whatever  the  sample  sizes 

are  using  the  1  as  8iven  1°  Theorem  2.6. 

N’  1  “  N 

This  algorithm  may  also  be  used  to  truncate  left  and/or  right 
portions  of  the  combined  sample.  The  result  is  quite  different 
from  the  Savage  test  and  is  discussed  in  section  2.4. 

(v)  in  Table  3  (p.  29)  we  also  gave  the  scale  tests  developed  from 
Parzen's  D(u)  model  which  correspond  to  the  logistic,  double  ex¬ 
ponential,  and  Cauchy  families  for  fQ.  Those  functions  provide 
formulas  for  testing  equality  of  scale,  and  thus  extend  the  set 
of  nonparametric  tests  at  our  disposal  by  combining  a  location 
and  scale  test  optimal  for  the  same  density. 

5.2.2  Exceedance  Tests  for  Scale 

A  variation  of  the  location  Haga  test  is  due  to  Kamat  (1956) 
and  haa  test  statistic  T-A+B'  -  A'  -  B  where  these  components 
are  defined  as  in  the  Haga  test  (see  5.1.2  (i))..  We  remark  that 
this  exceedance  test  is  also  a  function  of  jump  sizes  in 
Dj^(u)  •  FG  *(u)  and  Dj(u)  ■  CF~*(u).  Simpler  versions  of  the 
Kamat  test  are  given  by  Rosenbaum  (1953)  and  Klotz  (1962).  We 
mention  this  to  give  an  indication  of  the  work  done  relating  to 
comparison  functions  other  than  D(u)  ■  FH  *(u).  These  tests 
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provide  comparisons  for  further  research  for  Parzen's  (1980) 

D^(u)  and  D2<u)  techniques. 

5.2.3  Goodness  of  Fit  Tests  for  Scale 

Hajek  and  Sidak  (1967)  remark  (p.  99)  that  one  can  make  the 

Kolmogorov- Smirnov,  Renyi,  and  Cramer  Von  Mises  tests  more 

sensitive  to  differences  in  scale  by  successively  subtracting 

smallest  and  largest  pairs  of  CD  [see  Hajek  and  Sidak  (1967)] 

rather  than  subtracting  C_  ,  ...,  C  Successively.  So,  one 

1  k 

can  compute  a  goodness  of  fit  test  in  two  ways,  one  sensitive  to 
location  differences  and  one  sensitive  to  scale  differences. 

Again,  we  see  the  attraction  and  need  for  simultaneously  estimating 
location  and  scale  differences  for  a  given  problem,  Hq:  F  *  G. 

We  emphasize  that  Parzen  (1980)  has  both  a  location,  9,  and 

•A  A  ^ 

scale,  <l>,  component  in  the  D(u)  estimator  of  D(u)  *  F[H  (u)], 
the  comparison  distribution  function,  which  are  asymptotically 
optimal  for  the  same  fQ.  In  the  following  section  (5.3)  we  remark 
on  some  relationships  of  Parzen's  (1980)  methods  with  various 
robust,  adaptive*  and  combinations  of  other  techniques. 

5.3  Remarks  on  Some  Other  Approaches  and  Extensions 

5.3.1  Combinations  of  Separate  Tests  for  Location  and  Scale 

Duran,  Tsai,  and  Lewis  (1976)  have  combined  tests  of  location 
and  scale  to  also  simultaneously  test  for  equality  of  both 
parameters.  They  use  Randles  and  Hogg's  (1971)  result,  which  states 

I 


« 
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that  under  H  ,  even  translation  invariant  statistics  (Mood,  Klotz, 
o 

and  Ansar i-Bradley)  are  independent  of  odd  translation  invariant 
statistics  (Wilcoxon  and  Normal  scores) .  Then  using  Chernoff  and 
Savage  (1958)  they  obtain  the  asymptotic  normality  of  the  test 
statistics  under  Hq  and,  with  more  conditions,  an  asymptotic  bi¬ 
variate  normality  result  under  certain  alternatives.  Their 
alternatives  are  similar  to  Parzen’s  D(u)  model  where  we  assume 
6  and  i|>  small. 

They  gave  no  examples,  but  we  still  may  make  some  comparisons. 
More  research  is  needed  for  their  techniques  to  be  evaluated  on 
examples  and  they  did  not  provide  any  methods  for  estimating  the 
location  or  scale  differences.  Parzen's  approach  naturally  leads 
one  to  simultaneous  location  and  scale  tests  for  the  same  underlying 
density  which  is  not  the  case  with  the  even  and  odd  statistics.  For 
example,  combinations  could  be  the  Wilcoxon  an<f  Ansar i-Bradley 
(different  fQ)  or  Quartile  and  Median  (different  fQ)  or  Normal 
scores  and  Klotz  (fQ  =  normal)  tests.  The  analogous  result  from 
Parzen’s  D(u)  model  is  that  it  is  asymptotically  optimal  for  one 
fQ  (examples  in  Table  1,  p.  9).  For  local  alternatives  there  is 
a  simultaneous  test  for  location  and  scale.  From  Corollary  2.2, 


it  is 


‘■"'(iNi) 
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2 

and  is  approximately  a  x  (2);  but,  6  and  p  also  estimate  the 

differences  in  parameters  of  the  two  samples  [Parzen  (1980)]. 

•>  « 

In  section  3  we  use  D(u)  -  D(u)  to  help  choose  f  correctly.  Further, 
section  6.3  methods  will  estimate  the  differences  between  samples 
at  any  percentile  or  quantile  as  well.  Lepage  (1975,  1976) 
also  gives  many  results  on  the  distributions  and  efficiencies  of 
this  method  of  combining  tests.  Other  authors  have  tried  to  form 
tests  which  are  insensitive  to  differences  in  one  of  the  parameters 
while  they  are  sensitive  to  differences  in  the  other. 


5.3.2  Robust  and  Similar  Tests 

ai 

The  classical  F-test  for  H  :  —  "1  has  been  found  to  be  non- 

o  o2 

robust  to  deviations  from  normality  with  respect  to  size  by  many 
authors.  Shorack  (1969)  examines  an  approximate  permutation  test, 
a  "jacknife"  procedure,  and  some  "rank  like"  and  other  tests  for 
Hq:  —  ■  1  by  considering  their  Pitman  asymptotic  relative  ef¬ 
ficiency  and  Monte  Carlo  studies  of  power.  Shorack's  simulation 
included  the  uniform,  normal,  and  double  exponential  densities. 

The  rank  like  tests  do  not  use  all  the  scale  properties  of  a 
continuous  variable  but  have  other  desirable  properties  as  given 
in  Moses  (1963).  The  permutation  test  (APF-test)  is  an  approxi¬ 
mation  based  on  an  F  statistic  and  a  minor  variation  (for 
locations  unknown)  of  an  approach  used  by  Box  and  Anderson  (1955). 
It  has  the  same  asymptotic  relative  efficiency  as  the  F  test  but  a 
very  different  robustness  level  as  seen  In  the  Monte  Carlo  results. 


J 


V* 
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Shorack  also  invents  Chls  test  to  obtain  a  confidence  interval  for 

°1  2 

(~)  .  Another  quite  practical  test  which  did  not  do  badlv 

2 

in  the  simulation  for  m  -  n  was  Levene's  (1960)  test.  Although 

assumptions  are  violated,  Levene  suggested  doing  an  ANOVA  on  the 

means  of  { (X^  -  X)2}  and  {(Y^  -  Y)^}.  The  jacknife  like  procedure 

also  requires  m  *  n  and  jacknifes  the  logarithms  of  sample 

variances.  It  was  on  a  par  with  the  APF  test  in  terms  of 

2 

power.  The  rank  like  tests  were  also  an  ANOVA  of  log  where 
is  a  scale  parameter  estimator  for  a  subgroup  of  all  n  or  m 
observations.  The  APF  test  was  reported  to  do  quite  well  in 
comparison  to  the  others.  The  approximation  of  the  APF  test 
statistic's  distribution  makes  it  attractive  for  small  samples, 
but  it  seems  quite  cumbersome  especially  in  confidence  interval 
calculation.  It  does  not  help  one  choose  fQ  nor  does  it  decide 
anything  about  the  location  parameter  differences. 

Some  of  the  earliest  attempts  to  deal  with  the  two  sample 
problem  were  also  approximations.  Murphy  (1976)  compares  the 
t-test,  Aspin-Welsh  approximate  t-test,  and  Wilcoxon  test  by 
simulation.  The  Aspin-Welsh  approximate  t  statistic  is 


where 


1  .  c2  (1-c)2 

df  (m-1)  (n-1) 


and 


given  by  Welsh  (1937)  with  critical  values  by  Aspin  (1949).  The 

three  tests  were  compared  for  normal,  uniform,  and  exponential 

f  densities.  It  is  interesting  that  Murphy  concludes  the 

Aspin-Welsh  test  is  highly  satisfactory  for  Hq: 

when  ^  o^  while  the  Wilcoxon-Mann-Whitney  is  not.  Murphy 

also  pointed  out  that  no  test  was  satisfactory  when  skewness  was 

present.  We  may  choose  f  and  calculate  our  results  even  for 

f  skewed.  When  one  assumes  f  normal  and  o,  +  cr  testing 

Hq:  ■  u2  is  known  as  the  Behrens-Fisher  problem.  Sheffe 

(1970)  presents  practical  solutions  to  the  problem.  It 

appears  Behrens  (1929)  and  then  Fisher  (1935a)  began  this  expansion 

of  the  two  sample  location  problem  by  considering  o^  f  o^.  This 

was  Sir  Ronald  Fisher's  (1935a)  controversial  paper,  "The  Fiducial 

Argument  in  Statistical  Inference."  He  also  proposed  nothing 

less  than  a  randomization  test,  also  in  1935,  in  his  book.  The 

Design  of  Experiments.  Many  authors  have  studied  the  robustness 

of  the  t-test  with  respect  to  a.  Posten  (1978)  did  an  extensive 

simulation  study.  He  did  his  study  of  the  t-test  over  87 

Pearson  curve  distributions  where  the  level  of  the  test  was  estimated 

from  100,000  generated  t-values,  except  for  one  case  (n*30  had 

"only"  83,000).  Posten  varied  n  from  5  to  30,  from  0  to  2, 

and  $2  f ro®  1.4  to  7.8.  Posten  points  out  the  obvious  conclusions  from 
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In  fact,  all  tabulated  significance  levels  round  to  .04,  .05,  or 
.06  through  the  whole  simulation  study  which  had  nominal  level  a  •  .05. 
Other  authors  [e.g.  Pearson  (1931),  Geary  (1947),  Finch  (1950),  Gayen 
(1950)  and  Box  (1953)}  have  shown  that  this  is  not  the  case  with  the 
commonly  used  and  taught  F-test  for  variance. 

Since  the  t-test  is  very  robust  with  respect  to  a,  we  should 
choose  a  linear  rank  test  based  on  power  or  other  considerations, 
not  the  accuracy  of  significance  levels.  In  this  regard,  Fligner 
and  Killeen  (1976)  have  introduced  analogues  of  the  Ansari- 
Bradley,  Mood,  and  Klotz  tests  which  have  the  same  Pitman  ef¬ 
ficiency,  but  significantly  higher  powers  for  small  samples. 

They  respectively  are 

-1  m 

T,  -  [m(n+m)]  \  R  , 

i=l  1 


2—1  11  2 

T»  ■  (m(nrHi)  ]  £  R  ,  and 

i»l  1 


m 


t3--'11I1  ♦'1'i+kSIT>2  • 

where  is  the  rank  of  V  ■  |x^  -  m|  among  the  combined  sample 
of  V^'s  and  =  |  Yj  -  mj  where  m  is  the  median  of  the  combined 
sample  of  {X^}  and  {Y^}.  These  tests  may  be  chosen  on  the  basis  of 
small  sample  power,  where  we  would  choose  Parzen's  (1980)  tests 

A 

relating  to  D(u)  based  on  simultaneous  estimation  of  0  and  <fr  or 

graphical  and  statistical  help  in  choosing  f  .  Perhaps  these 

o 

authors'  "score"  functions  can  be  interpreted  in  a  way  to  help 


s r  \ 


% 


98 


develop  more  snail  sample  estimators  for  the  Parzen  approach. 

Other  authors  have  conjectured,  if  one  can  not  reduce  the  influence 
of  nuisance  parameters  in  a  particular  test,  perhaps  one  can 
adapt  to  the  influence  of  the  nuisance  parameter  and  obtain  a 
more  powerful  test  of  chat  particular  parameter.  We,  in  fact, 
model  both  the  location  and  scale  differences  in  the  work  here. 

5.3.3  Adaptive  Type  Tests 

Sen  (1962)  and  Potthoff  (1963)  have  attempted  adapting 

rank  tests  for  location  to  adjust  for  unequal  variances  by  a 

conservative  approach.  However,  many  of  the  rank  tests  then 

became  dependent  on  f  .  Others  have  been  more  successful. 

o 

Hogg,  Fisher,  and  Randles  (1975)  have  designed  adaptive  location 
test  procedures  for  3kewed  distributions.  Very  few  nonparametric 
or  robust  procedures  consider  how  to  detect  or  what  to  do  when 
skewness  is  present.  Parzen  (1979,  1980)  has  also  given  techniques 
to  help  detect  bimodality,  as  well  as  skewness, by  using  an  auto¬ 
regressive  density  estimator.  Hogg  (1976)  also  remarks  on  a 
possible  adaptive  tvo  sample  scale  test  where  one  decides  to  use 
a  Kamat,  Klotz,  Ansari-Bradley ,  or  quartile  test  based  on  the 
combined  order  statistics. 

5.3.4  Other  Approaches  of  Interest 

Rorwar  and  Hollander  (1975)  have  given  an  empirical  Bayes 
estimator  for  F(x)  which  is  optimal  for  a  Ferguson  Dirichlet 
prior.  Perhaps  they  would  want  us  to  develop  a  Bayes  D(u)  and 


D(u) .  The  first  problem  is  to  determine  when  this  prior  is 
adequate. 

Censored  modifications  for  the  Kolmogorov-Smimov  test  have 

been  given  by  Tsao  (1954)  and  Ishii  (1958).  Mehrotra  and  Johnson 

(1976)  extend  results  in  Hajek  and  Sidak  for  asymptotically  most 

powerful  tests  in  the  two  sample  problem  to  apply  to  censored 

data,  i.e.,  the  first  r  observations.  As  mentioned  in  Parzen 

(1979,  1980),  one  can  truncate  the  reproducing  kernel  Hilbert 

space  estimates  by  using  <f^,f2>  where  0  <  p  <  q  <  1  or  use 

p.q 

an  inner  product  based  on  the  censored  observations. 

Other  directions  to  go  include  the  Wald  and  Wolf owitz  (1940) 
runs  test  and  any  relation  it  has  to  these  methods.  Also,  Sen 
(1963)  has  investigated  a  class  of  tests  based  on  linear  combin¬ 
ations  of  the  number  of  Y,  between  X.,.  and  X,.^.  which  can  be 

i  (i)  (i+l) 

related  to  the  spacings  of  the  jump  points  in  D(u) .  Eubank 
(1979)  provides  one  sample  optimal  spacings  which  can  be  generalized 
to  the  two  sample  problem. 

With  regard  to  estimating  scale  differences,  Bhattacharyya 
(1977)  has  given  techniques  based  on  Sen's  (1966)  modification  for 
scale  parameters  of  Hodges  and  Lehmann's  technique  for  estimating 
location  shift.  Bhattacharyya  provides  estimators  of  c^/c^ 
corresponding  to  the  Ansar i-Bradley,  Siegel-Tukey ,  and  a  modified 
Sukhatme  test.  Lambscher  and  Odeh  (1976)  have  also  proposed 
practical  methods  for  estimating  scale  parameters  from  a 


Sukhatme  test.  Duran  (1976)  gives  a  review  of  approximately  80 
references  on  tests  for  scale  with  many  comments  on  these  and 
other  tests.  One  is  the  Barton  and  David  (1958)  test,  not  covered 
here.  The  great  number  of  techniques  Duran  comments  on  makes  it 
impossible  to  be  very  detailed  for  any;  but,  he  gives  many  valuable 
comments  and  references  on  comparisons  of  these  tests  and  "minor" 
modifications  of  them.  Also,  Zuijlen  (1977)  extends  much  of  the 
rank  tests'  distribution  theory  to  tie  non  i.i.d.  case.  Other  papers 
of  particular  interest  deal  with  comparison  function  techniques. 

5.3.5  Comparison  Function  Techniques 

Wilk  and  Gnanadesikan  (1968)  stimulated  research  in  the  area 
of  probability  plotting  where  they  use  Q-Q  and  P-P  plots  to  compare 
data  sets.  A  Q-Q  plot  is  essentially  a  plot  of  the  Y  quantile 
function  versus  the  X  quantile  function  (see  seption  4.2),  the 
points  being  joined  for  a  common  u,  i.e.,  G  ^F(x)  versus  X  *  Q  (u) . 

A 

P-P  plots  are  a  plot  of  u^  =  O^versus  u^.  *  where  Qx  *  Q^ 
at  each  point,  Switzer  (1976)  and  Doksum  and  Sievers  (1976) 
extend  the  graphical  work  of  Wilk  and  Gnanadesikan  by  developing 
confidence  procedures  for  various  comparison  functions  used  in  the 
two  sample  problem.  They  estimate  a  general  treatment  function, 
t(x).  Since  they  include  the  data  sets  in  their  papers,  we  are  able 
to  compare  their  results  to  those  developed  here  (section  4). 

Steck,  Zimmer,  and  Williams  (1974)  have  also  developed 
confidence  bands  based  on  D£(u)  =  G[F  *(u)]  or  D^(u)  *  F[G  *(u)]. 
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Further  research  may  generalize  this  to  D(u)  *  F(H  ^(u)}  and 
provide  some  further  comparison  of  the  two.  Doksum  (1974)  has 
also  given  the  asymptotic  distribution  of  t'(x)  *  G  ^[F(x)]  -  x. 
Doksum  and  Sievers  (1976)  have  also  begun  developing  confidence 
bands  with  or  without  a  location  scale  model  being  assumed.  They 
also  invert  two  sample  statistics  for  their  bands.  Their  location 
scale  m  al  is 


°2  °2  CT2 
t'(x)  =  u2  +  ~  (x-U^-x  =  w2  -  —  Vj_  +  (—  -  Dx 


*  u2  -  (<!>- 1)  u.  +  <P  x. 

U2-Ul  °2-<Jl 

In  this  case,  simultaneously  estimating  9  =  -  and  i|j  *  — - — 

1  1 

is  not  done.  However,  they  have  given  a  likelihood  ratio  confidence 


band  for  fQ  normal  when  m  =  n.  Further  research  could  compare  this 
with  Parzen's  (1980)  techniques  for  estimating 


D’(u) 


d(u) 


=  f [G~I(u>  ] 
g[G_i(u)] 


the  likelihood  ratio  for  the  two  samples  which  does  not  require 
m  *  n.  Doksum  and  Sievers  show  asymptotic  equivalence  to  M.L.E. 
bands  and  remark  that  an  advantage  of  cheir's  is  that  it  can  be 
applied  to  censored  data.  Again,  Parzen's  (1980)  techniques  have 
a  potential  for  censored  data  analysis  which  can  be  further  ex¬ 
plored.  They  also  remark  that  some  of  their  numerical  results  show 
that  the  general  bands  are  quite  inefficient  if  the  correct  model 
is  normal.  This  gives  us  a  motivation  to  use  D(u)  in  helping 
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identify  the  correct  f  .  We  do  it  both  for  statistical  reasons  of 

o 

efficiency  and  scientific  reasons  of  identifying  a  correct  model. 
We  conclude  this  section  with  a  few  comments  on  some  of  the  vast 
amount  of  research  concerning  the  two  sample  problem. 

5.4  Remarks  on  the  Literature  Review 

The  Behrens-Fisher  problem  remains  open  45  years  after  the 
work  for  which  it  was  named  and  the  list  of  several  more  general 
approximate  solutions  grows.  The  approach  of  Par2en  (1980) 
implemented  here  has  many  of  the  aspects  of  several  of  the  other 
authors  through  the  decades.  Hopefully,  it  will  contribute  to  a 
unified  approach  by  consideration  of  D(u)  and  D(u)  which  nearly 
all  the  previous  techniques  are  related  to  in  some  way.  Thus  far, 
only  asymptotic  properties  of  D(u)  have  been  given;  however, 
since  0  and  <j>  directly  relate  to  linear  rank  tests,  they  provide 
an  easy  extension  to  calculation  of  simultaneous  estimates  of 
location  and  scale  differences  and  use  of  the  finite  sample  size 
linear  rank  tables.  This  helps  unify  the  techniques  of  sections 
5.1.1  and  5.2.1.  We  also  see  the  importance  of  D(u)  in  the 
exceedance  and  goodness  of  fit  tests.  The  relationship  of 
D(u)  to  D(u)  will  help  utilize  goodness  of  fit  tests  in  choosing 
the  correct  linear  rank  test.  By  matching  D(u)  to  D(u)  we  will 
not  just  adapt  the  scale  differences  and  estimate  the  location 
differences  or  vice-versa.  Rather,  we  will  simultaneously 
estimate  location  and  scale  differences  and  by  comparing  the 


i  - 


v 


V 


results  for  various  fQ's,  determine  which  family  should  be  assumed 
for  a  good  fit.  This  has  begun  to  be  explored  in  sections  3.1 
to  3.2. 

One  method  of  obtaining  a  robust  estimate  Is  to  trim  the  data. 
This  can  be  explored  by  considering  0  <  p  <  q  <  1  rather  than 
p  =  1  -  q  *  0. 

The  recent  research  on  many  different  comparison  function 
techniques  provides  techniques  to  compare  with  Par2en's  (1980) 
approach  using  the  comparison  function,  D(u)  =  F[H  ^(u)].  This 
is  begun  in  section  4. 

One  problem  with  the  two  sample  research  has  been  simultaneous 
testing  of  location  and  scale  differences.  This  approach  clearly 
will  provide  a  solution,  i.e.,  from  Corollary  2.2,  we  have 

2, 


My 


(|HC)E 


'(2)  , 


under  Hq:  F  *  G  where  0  is  the  location  component  and  0  is  the 
scale  component.  One  can  easily  see  which  difference  has  a 
greater  change  with  respect  to  at  least.  Another  problem, 
especially  with  nonparametric  or  distribution  free  tests,  has  been 
to  estimate  the  difference  once  one  has  been  detected.  Parzen's 

*  a 

(1980)  technique  provides  local  estimators  ,  0  and  of  the  differ¬ 
ence.  With  A.  we  estimate  u,  -  u,  and  o.  -  o,  under  H  . 

£12  1  a 

This  pair  exists  for  many  useful  densities  and  each  estimator  is 
asymptotically  optimal  for  a  common  density.  Another  problem 
which  the  adaptive  tests  are  designed  to  deal  with  is  to  first 
make  a  decision  about  the  type  of  f  and  then  an  independent  test 
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of  a  parameter  difference  using  Che  decision  about  fQ.  Parzen's 

approach  using  D(u)  and  D(u)  gives  the  asymptotic  distribution  of 

d  and  ^  given  fQ.  The  results  in  section  3  lead  to  a  minimum 

distance  choice  for  f  among  the  set  of  f  that  one  considers. 

o  o 

By  examining  the  6  and  for  each  f  we  gain  an  indie? tion  of 
the  importance  or  lack  of  importance  as  to  which  f  we  should 
assume.  By  examining  the  residuals,  D(u)  -  D(u),  we  use  both  the 
location  and  scale  parameters  to  decide  on  fQ.  However,  the 
estimators  of  0  and  are  functions  of  the  R^  so  a  topic  of  further 
research  could  be  to  try  to  obtain  an  independent  choice  of  fQ. 
Still  the  scientific  interpretations  may  lead  one  to  consider 
several  models  for  the  data  although  a  statistically  more  powerful 
test  may  exist  choosing  just  one  model.  In  fact,  the  adaptive 


answer  is  to  choose  fQ  based  on  D  and  then,  independently  estimate 
Aq.  We  may  do  this,  since  {R^}  are  independently  distributed  of 
{x(i)}. 


A  common  problem  that  robust  techniques  try  to  deal  with  is 

having  a  known  f  for  the  data  but  shifted  location  and/or  scale 

for  part  of  the  data.  As  mentioned  earlier,  using  0  <  p  <  q  <  1 

would  be  a  common  technique  for  dealing  with  this  problem.  There  is 

an  indication  the  truncation  does  well  with  the  Cauchy  f  in 

Rothenberg,  Fisher,  and  Tilanus  (1964),  This  can  be  further 

explored.  Also  one  may  attempt  to  model  skewness  with  appropriate 

f  in  the  D(u)  model  or  A.(u)  model, 
o  Q 


1.* 


05 


Scot:,  at.  al.  (1976)  have  praaancad  a  bivarlaca  ianaity 
aaclaacioa  cachniqua  which  alao  halpa  daal  with  biaodality.  Via 
alao  analyze  hla  data  aat  la  taction  4.3,  although  not  with  a 
bivarlaca  approach.  Praaantly  vary  f aw  approaehaa  actaapt  to 
estimate  akawnaaa  or  bimodality  dlffarancaa  of  two  samples. 

Soma  indication  of  hov  chia  might  ba  dona  with  a  quadratic  D(u) 
modal  is  given  in  aaccion  6.  For  othar  diractlona  of  reaaarch 
saa  Parzan  (1979,  1980)  or  saction  6. 


* 


v 
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6.  Si  IMF  \l.  IHRNATIVK  MODELS  FOR  !)(.i) 

In  our  approach,  just  as  In  ordinary  regression,  one  will  often 

consider  more  than  one  model  for  the  data. 

With  our  D(u)  model  we  assume  an  f  family  for  the  underlying 

density,  although  we  give  techniques  to  help  choose  it.  We  also 

assume  a  linear  Taylor  series  expansion  is  adequate  and  that  6  and 

•i>  are  small.  If  '  and  are  found  to  he  of  moderate  size  we  mav 

wish  to  improve  the  expansion  by  including  more  terms  as  in  Section  6.1. 

Rather  than  including  more  terms,  we  may  use  the  6^  (see  1.10) 

model  which  is  accurate  under  the  alternatives  0^0  and/or  ^  0. 

In  section  6.2  we  do  this, still  assuming  that  the  underlying 

family  is  the  same  for  both  the  and  Y  .  There  we  suggest  es- 

u2~ul 

timators  of  u,  -  u,  rather  than  0  «  -  and  a„  -  o,  rather  than 

2  1  o  2  1 

1  *  — -  .  If  we  had  no  idea  which  f  would  model  the  data,  we 

11  0 
may  wish  to  construct  a  model  which  converges  to  the  correct  fQ 

and  is.  In  a  sense,  f  free.  In  Che  past  the  convergence  has  been 

quite  slow  and  tests  have  been  less  powerful  than  those  which  assume 

f  known.  However,  in  section  6.3  we  suggest  methods  to  be  explored 
o 

which  make  still  fewer  assumptions  than  those  made  thus  far 
regarding  fQ. 


6.1  The  Quadratic  Model 

In  this  section  we  give  a  Quadratic  expansion  which  results  in 
an  alternate  model  of  D(u)  containing  some  of  the  quadratic  terms,  i.e. 

*-,!l 

those  of  the  quadratic  expansion  of  F  about  -  . 

o  a. 
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In  section  Z  we  represent  G(x)  in  terns  of  ?  as  follows  (when 
3  and  4  are  small) 


x-u,  x-u, 

G(x)  *  -  9  -  H-~)] 

X-U  ,  X-u . 

A  quadracic  Taylor  series  expansion  of  F  )  abouC  — j—  gives 

1  1 

x-u ,  x-u  x-u . 

G(x)  i  ?  <— -*>-f  (-—)[«■>«(— i)l 

0  3^  OU^  0  ^ 


,  .XU., 

+  %  f  ’  (-“ ^)C  e+*  (— -1)  3 2 

o  «T1  ^ 


Again,  letting  x  -  H  1(u)  -  F'^u)  •  Qq(u)  48  on  P*  1®*  *iv4s 

x-u 

q  (u)  i  — —  and 
o  J1 

G(H_1(u)]iFH"l(u)-f  Q  (u)[e-N.  Q  <m)i*f  0  (u)J  (u)[e2+2e»Q  (u) 

00  o  000  o 

+  Qo2 (u) ]  . 

Since  H(x)  «  XF(x)  +  (l-X)G(x),  we  have  for  x  -  H  ^(u) 

2 

HH"1(u)iXFH'1(u)+(l-X)fFH“L(u)-9f  Q  (u)-|«Q  (u)f  Q  (u)4rf  Q  (u)J  (u) 

00  o  00  *00  o 


2 

+  9«if  Q  (u)J  (u)0  (u)+rf  0  (u)J  (u)Q  ‘(u)]  , 


o  0 


-  0^0 


and,  since  D(u)  •  FH  (u)  ,  we  have 


u  *  D(u)-d-X)  [3f  0  (u)+*j;Q  (u)f  0  (u)+9’f  0  (u)J  (u) 

03  0  00  00  O 


+  y'f  Q  (u)J  (u)0  (u)+t'(  0  (u)J  (u)0  (u) ] 

00  0  0  00  o  o 


108 


2  2 

where  O'  -  -40  ,  y'  -  -464'.  and  4>‘  “  -*#  .  Finally, 

D(u)-u-(l-X)|8f1(u)+4'f2(u)+e'f3(u)47,f4(u)+4'’f5(u)], 

where  f . (u)  *  f  0  (u),f,(u)  -  Q  (u)f  0  (u),f  (u)  *  J  (u)f  Q  (u), 

1  oo  L  o  o  o  3  ooo 

f^(u>  *  Qo(u)JQ(u)foQo(u) ,  and  f,.(u)  »  Qo2(u)Jo(u)fQQo(u) . 

If  the  f^(u)  art  In  the  RKHS  of  B(u)  with  p  »  1  -  q  •  0,  then 

we  may  estimate  0,  4>,  0',  y',  and  4>'  provided  we  treat  6',  y', 

and  <j >'  as  free  parameters.  For  computational  convenience  we  would 

do  this  to  begin  with.  This  model  gives  us  a  5  x  5  matrix,  E^, 

rather  than  a  2  x  2  I  as  in  the  linear  expansion.  In  fact,  for  E^ 

and  in  this  quadratic  expansion  we  need  fifteen  and  five  Inner 

products  to  exist  respectively.  One  may  try  to  orthogonalize  E^ 

to  reduce  the  problem.  We  suspect  the  shapes  of  foQo(u)JQ(u)  •  Po(u) 
2 

and  f  Q  (u)J  (u)Q  (u)  may  be  useful  detectors  of  blmodality 
o  o  o  o 

and  skewness  respectively.  We  leave  this  for  further  research. 

6.2  The  Model 

As  shown  in  Theorem  1.1,  Parzen's  (1979)  model  for  Q(u)  in 
the  one  sample  and  scale  problem  gives  an  asymptotically  exact 
two  sample  model  for  Aq(u)  also.  In  this  section  we  further 
develop  this  model  by  suggesting  estimators  of  Uj-Uj •  °2“°1’  an<* 
Aq(u)  based  on  Parzen  (1961,  1967).  We  also  give  the  asymptotic 
distribution  of  these  estimators  and  some  remarks  on  their  use  in 
the  analysis  of  two  sample  data. 
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The  model  for  Aq(u)  suggested  by  Theorem  1.1  is 
f  Q  (u)A  (u)-A  f  Q  (u)+&  0  (u)f  Q  (u)  , 

Oo  Q  WOO  CO  oo 

with  the  following  estimator 

f  Q  (u)A  (u)"A  f  Q  (u)+A  Q  (u)f  0  (u) 
o  o  Q  v  o  o  o^o  o  o 

obtained  using  Parzen  (1979)  results. 

We  suggest  estimators  of  &}J  *  U2  -  W1  an<*  m  °2  ~  °1 
in  Theorem  6.1  and  give  their  asymptotic  distribution. 

Theorem  6.1:  If  tho  conditions  of  Theorem  1.1  hold  and  f  Q 

- -  - . .  o  i 

and  Q  (f  Q  )  are  memhei.3  of  .he  RKHS  of  B(u)  for  p  »  1  -  q  »  0, 
o  o  o 

then  as  N  -*■  »  such  th-sa  1  ■  z  *  (0  <  X  <  1) ,  we  have 

>  N  o  o 


n  »  <»  A  A  A  2  2  2  M  A 

where  A  -  p.-u,  A  -  c,-c, ,  c,  ■  i  o.  +(1-A  )o  ,  and  u.  and  c 
U  i  1  0  L  i  L  01  O  l  1  1 

are  as  given  in  Parzen  (1979). 

Proof :  From  Csorgo  and  Revesz  (1978),  Parzen  (1979),  and  Eubank.  (1979) 
we  obtain  for  i  *  1  or  2, where  n^  denotes  the  ith  sample  size  , 

^"i  fo^o*U^<Mu*~wi~0i^o*U^  ^  °i  » 
and  further  [using  Parzen  (1961,  1967)), 


V 
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S  N2  1<o).  ^  1  , 


Letting  A  *  u_  -  m,  and  A  ■  a,  -  o.,  we  obtain 

M*1  Oil 


X  8  »2  1(2>-  Cj2!'1  I 

o  a 


2  2  2  -1  ~1 
where  c 2  =  +  (l-*0)°2  »  since  N  *  n^  Ajj  -  i^d-Af.)  and  XN^Q 

as  N-*»,  and  linear  combinations  of  independently  distributed  random 

variables  converge  to  the  linear  combination  of  their  asymptotic  limits. 


Corollary  6.1:  If  the  conditions  of  Theorem  6.1  hold,  then 

•N 

A^  and  A^  are  given  by 


Z'1  & 2  ~  Z'1  "  S  *  £3  , 


where  2  is  given  in  section  2,  and  £2  are  given  in  Parzen  (1979) 


and  Eubank  (1979)  for  i  ■  1,  2  as 


<f  Q  ,(f  Q  )Q  > 
00  o  o  1 


^(f  Qn),(f  QjQf> 

000  001 


and  we  define  £3  M  i2  “  £1 


Proof:  By  definition  of  ^  and  A^, 


[■■]  ■ 


By  definition  of  matrix  operations,  l  ^l"1' 

since  £3*£2“£i‘  Then, by  definition  of  inner  products,  since 


11 


we  note  that 
r»  , 

"WWW* 

13  'VW  •  WVV’ 

_  *• 

Remarks:  We  call  lQ(u)  ■  Qy(u)  -  Qx(u)  L^e  raw  difference  °f 
quantile  functions  at  the  quantile  u  and  <ig(u)  *  Q^Cu)  -  Qx(u)  the 
estimated  difference  of  quantile  functions  at  the  quantile  u. 

These  names  are  suggestive  of  our  interpretation  of  Aq(u).  Note 
that  this  interpretation  and  model  of  Aq(u)  are  asymptotically 
exact  under  all  location  and  scale  alternatives  of  Hq:  F  -  G, 
i.e.,  3^0  and  /  0.  However,  since  involves  Che  scale 
parameters,  we  note  that  in  using  the  estimators  suggested  here, 
as  in  Parzen  (1979)  and  Eubank  (1979),  we  presently  need  to 
treat  c 2  as  a  free  oarameter.  The  implementation  and  adequacy 
of  che  treatment  and  model  is  a  oroblem  for  further  research.  We 
emphasize  that  &q(u)  may  be  estimated  independently  of  D(u) . 

Next  we  give  a  definition  and  the  asymptotic  distribution  of 
lq(u),  0  <  u  <  1.  Let  l3(u)  ■  /N  fQ0o(u) (i^(u)-  Aq(u)]  be  the 
standardized  lq(u)  so  that  ^^(u)  •  0  for  0  <  u  <  1. 

Theorem  6.2:  If  the  conditions  of  Theorem  6.1  hold  and  f.  is 

—  1  o 

•%  A 

symmetric,  for  {  ut  t(0,l);  i  -  1,  ....  k;  and  (u^) ,  As(u.,), 

■*  I  m 

...,A3(u^)j  ,  then  as  N  -  ®  such  that  '»*!*  '» <°  *  '  »•  « 


1  (u)  »  S.  (0,  c 


*  "  > 
:  k* 


\ 


i 


v 


have 
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Remarks :  For  this  A^(u)  estimator  we  may  directly  apply  the  results 

of  Eubank  (1979)  and  choose  (u^;  i  =  1,  ....  k  }  for  small  k  as 

he  suggests.  We  may  also  use  {u^;  i  =  1,  ....  N  }  where 

Q^(u)  or  Qx(u)  have  jump  points,  i.e.,  u^  corresponding  to  the  data 

points. 

A  model  for  A^(u)  provides  many  problems  for  further  research 
besides  those  mentioned  thus  far.  For  example,  as  Professor  W.  C. 
Parr  has  pointed  out  to  me,  if  a  plot  of  A^(u)  versus  Qq(u)  is 
linear,  then  F  and  G  are  location  and  scale  shifts  of  the  dis¬ 
tribution  corresponding  to  Qq(u).  Further,  the  intercept  of  the 
vertical  axis  is  u 2  -  and  the  slope  of  the  line  is  a 2  -  o ^ . 

Tests  and  estimates  based  on  this  fact  are  a  topic  of  further 

research.  We  also  leave  the  Brownian  bridge  representation  of 
•>  *  — 

Q(u),  Aq(u),  and  A^(u)  -  A^(u)  as  topics  for  further  research. 

We  remark  that  the  residuals,  Ap(u)  -  Aq(u),  once  their  distribution 
was  derived,  could  be  used  to  select  an  appropriate  f  to  model  the 
data  for  any  location  and  scale  alternative  hypothesis  of  Hq:  F  ■  G 
for  independent  samples . 

Finally,  although  we  do  not  address  the  k-sample  problem  in 
this  work  we  offer  Theorem  6.3  for  the  following  definition  of 
the  k-sample  problem  to  suggest  further  research. 

Suppose  we  have  k  >  2  independent  random  samples,  denoted  by 


(Xj i ;  j  *  1,  •«.,  k  ;  i  ■  1,  ...,  n^ } 


where  n.  is  the  sample  size  of  the  jth  random  sample  and  each  sample 

is  n.  realizations  of  the  j random  variable  X  .  Further, 

J  -’x-u . 

suppose  X.  has  distribution  function  F. (x)  =  F  ( - *4  where  F 

i  i  o  o  o 

J  J 

satisfies  the  conditions  of  Theorem  6.1.  This  is  essentially 

a  generalized  analysis  of  variance  problem  studied  by  White  (1981) 

*  v 

and  similar  to  a  problem  studied  by  Hajek.  and  Sidak  (1967), 
chapter  3,  section  4,  and  Sen  (1962),  but  more  general.  In  the 
following  theorem  we  may  study  a  general  contrast  of  the  location 
parameters  and  the  scale  parameters  simultaneously  for  the  k 
populations . 


Theo rem  6.3:  For  this  definition  of  the  k-sample  problem,  let 

k 

(a,;  j  *  1,  . . . ,  k  }  be  fixed  constants,  Z  *  \  a  Q  (u) , 

j  "  j  =  l  J  J 

k  k  k 

i  *  Y  a,p  ,  and  Z  =  J  a,o,.  Then  as  N  =  7  n,  -*■  00  such  that 

“  j.L  Jj  »  h”  j-l  i 

ni 

L,  *  -J-  -  X  ,  (0  <  X  <  1) ;  j  =  1,  .  . . ,  k,  we  have 

Nj  N  oj  Oj 

^  fQQ0(u)  (-tQ(u)-^iJ-'ta  Q0(u)]  +  c3  B(u) 


where  Z  =  j  a,u,  and  Z  =  7  a.  a.,  and  ».  and  ^  are  as  in  Parzen 

u  j-i  J  J  0  A J  3  ,  1  k 

(1979)  and  Eubank  (1979).  Finally,  c-  =  [  a  X  o  . 

j=l  J  J  J 


V 


Proof: 


k 

-  a  A  1  z  .  B  (u)  ■  c-  3(u)  since  all  the  3, (u)  are  in- 
jil  J  °J  J  J  i 

dependenc  3rownian  bridges.  This  gives  Che  first  result.  Similarly, 


k  u  -y  l  -l 

*  J- ^  <1-1 
3*1  i  i  -  a 


) 


gives 


,  r  2,  2.  -1  2  -1 

^  *»JaJ  >  :  ‘  • 


for  the  variance-covariance  matrix  needed.  Clearly,  the  asymptotic 
mean  is  zero. 


6.3  Raw  0  and  J  Pst ima tors 

Por  the  D(u)  model  suggested  by  Parzen  (1980)  we  have  im- 

u,-u.  °2_<J1 

plementad  estimators  of  9  «  — - -  and  \l>  *  ■—  -- —  given  f  .  Our 

°1  CT1  ° 

l^(u)  model  also  depends  on  fQ.  A  topic  of  further  res^fatfch  is  to 
develop  estimators  which  converge  to  f  and  Drovide  estimators  of  the 
location  and  scale  parameters.  In  this  section  we  suggest  an 
approach  to  this  topic. 

Suppose  we  accept  the  linear  approximation  in  D(u)  but  do  not 

have  a  viable  choice  for  f  .  We  may  Chen  consider  studying  the 

technique  proposed  in  this  section.  For  the  inner  products  in 

Z  ^  e  we  only  need  Q  and  J  .  Parzen  (1979)  gives  a  consistent 
o  o  , 

estimace  of  Qg  in  Q  and  Hajek  and  Sidak  (1967,  p.  260  equation  (7)) 
give  a  consistent  estimate  of  J,  denoted  J,  which  are  each  functions 
of  the  order  statistics. 


i 
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Then  Parzen's  (1980)  model  yields  (for  symmetric  f  ) 

o 

/01[-Jo(u) ld[D(u)-ul 

/01tJo(u)}2  du 

and 

*  /01fl-Qo(u)Jo(u)]d[D(n)-u) 

*  — ; - 5 -  , 

L fl-Q  (u)J  (u) J  du 
0  o  o 


which  are  solely  functions  of  Q  ,  J  ,  and  the  data,  D.  We  need 

o  o 

appropriate  definitions  of  J  and  Q  using  the  data  based  on  Q  and 
perhaps.  Further  research  may  explore  these  estimators  from 
Parzen  (1979)  and  Hajek  and  Sidak  (1967). 

One  of  the  difficulties  in  the  problem  would  be  to  combine  the 
two  samples' different  Q  and  J  to  obtain  the  0  and  estimators. 


\ 


V 
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7.  EVALUATION  OF  D(u)  THROUGH  SIMULATION  EXAMPLES 


As  in  any  regression  model,  our  regression  model  for  D(u) 
from  Parzen  (1980)  may  not  contain  the  correct  independent 
variables  and  error  term.  We  have  added  sections  2,  3,  4,  and  5 
to  Parzen's  arguments  for  using  the  D(u)  model  and  show  how  it 
provides  useful  information  whether  we  reject  Hq:  F  *  G  or  not. 

As  suggested  in  the  remarks  of  sections  4  and  5  and  in  the  con¬ 
fidence  regions  for  9  and/or  $  in  section  2,  we  desire  to  make 
inferences  using  8,  <p ,  and  D(u)  when  we  detect  that  0  ^  0  or 
ijj  4  0.  We  also  provide  the  A^(u)  model  which  we  know  may  be  used 
when  9  ^  0  or  iJj  t  0.  As  mentioned  in  sections  2.4  and  4.2,  we 
also  become  interested  in  trimming  our  estimates  of  9  and  4»  for 
some  particular  densities  or  in  the  presence  of  suspected  outliers. 
In  section  7.1  we  make  remarks  on  a  design  for  a  simulation  study  to 
evaluate  the  accuracy  of  the  D(u)  estimator.  In  section  7.2  we 
give  a  few  simulated  examples  with  "large"  0  and  ip  for  six  different 
densities  and  m  *  n  =  30. 


7.1  Remarks  on  Factors  of  Interest  in  a  Simulation  Study 

In  this  section  we  propose  that  a  simulation  study  of  D(u) 
and  Aq(u)  include: 
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(1)  investigation  of  the  effects  of  9,  tf>,  n,  and  m  on  the 

estimates  in  D(u)  and  A_(u)  for  various  f  ,  and 

Q  o 

(2)  investigation  of  the  potential  value  of  truncation  of  the 
estimators  inD(u)  and  Ap( u)  when  the  two  samples  have  some  con¬ 
taminated  observations. 

Each  of  the  factors  involved  in  a  design  should  be  at  several 


levels . 

We  propose  9,  ii ,  n,  m,  fQ,  and  contamination  as 

factors. 

Some  dependent  variables  of  interest  are: 

(1) 

D(ui> 

-  D(ui)  ♦  Aq (Uf)  -  Aq(u^)  , 

(2) 

D(Ui) 

-  0(1^)  ,  Ap (u^)  -  Ap(ui), 

(3) 

0-9 

,  A  -  A  , 

U  U 

(4) 

a 

V  -  V 

»  A  -  A  , 

0  0 

and  various  functions  of  these  quantities,  for  example,  mean 

square  error,  bias,  and  variance  of  the  estimates.  We  would  expect 

the  main  effect  of  each  factor  to  be  significant  in  predicting 

most  of  the  dependent  variables.  Also,  if  the  8  x  ^  interaction 

were  not  significant  in  its  effect  on  a  dependent  variable,  as  we 

hope  for  small  and  moderate  6  and  tp,  then  the  simultaneous  estimation  of 

location  and  scale  parameter  differences  will  pose  no  problem 

beyond  the  ordinary  estimation  problems  of  an  individual  location  or 

scale  difference  that  researchers  have  traditionally  dealt  with. 

The  implementation  of  this  simulation  study  is  a  topic  of  further 
research.  The  next  section  reports  on  six  simulated  examples. 


i 


'•  *  ’v  _ 
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7.2  Simulated  Examples 

While  in  section  4  we  compare  the  approach  here  with  analysis 

of  "live"  data  sets  from  other  research,  in  this  section  we  report 

on  a  few  simulated  examples  to  begin  to  explore  the  situations 

where  our  D(u)  model  will  obtain  reliable  results.  These  examples 

demonstrate  the  need  for  further  research  ard  understanding  of  tie 

techniques  developed  in  this  work. 

We  generated  six  data  sets  and  submitted  them  to  analysis.  All 

six  pairs  of  samples  used  m  *  n  =  30,  0  *  .5,  and  ^  *  .5.  One  pair 

was  generated  from  each  of  the  six  distributions  given  in  Tables  5a 

and  5b.  This  means  each  of  the  0  in  Table  5a  and  each  of  the  <|>  in 

Table  5b  are  estimating  the  true  value  of  .5.  The  N,  L,  C,  D.E., 

A.B.,  and  Q  denote  the  normal,  logistic,  Cauchy,  double  exponential, 

"Ansar i-Bradley",  and  "quartile"  densities  respectively.  The  by 

an  estimate  denotes  the  estimate  is  beyond  two  standard  deviations 

(under  H  :  F  =  G)  from  its  true  value  of  .5. 
o 

The  theoretical  error  rate  under  Hq:  0  »  i>  ■  0  is  approximately 
2 

.05  +  .05  -  (.05)  *  .0975  for  a  given  f  .  Since  we  have  no 

o 

replications  we  are  unable  to  draw  any  conclusions. 

It  is  pleasing  that  56  of  the  72  nonparametric  estimates  were 
within  two  standard  deviations  of  the  true  9  or  f  for  two  reasons, 
although  we  make  no  conclusions  regarding  the  results  without  replica¬ 
tions.  One  reason  is  that  five  of  the  six  columns  of  Table  5  have 

estimates  from  a  nonoptlmal  f  .  The  other  reason  is  that  and  o' 

o  v 

are  derived  under  Hq:  F  -  G,  rather  than  nonzero  values  of  0  and  ♦ 


\  - 
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Simulated  6  Examples 


Assumed 


True  f 
N 


.22 

.50 

.62 

.2] 

.31 

.87 

1.15 

.4; 

-.08 

.27 

.82 

.41 

0 

.27 

.53 

.4 

-.16* 

-.03* 

.23 

.3/ 

.01* 

.16* 

.08* 

5b.  Simulated  i|i  Examples 


Assumed 


True  f 
N 


*^(0)  and  <jj  *  •'V(i)/)  under  H 


as  in  these  examples.  Perhaps  some  questions  of  interest  suggested 

by  Tables  5a  and  5b  would  involve  testing  location  parameters  with 

f  the  "quartile"  density  and  testing  scale  parameters  with  f  the 
°  o 

Cauchy  density. 

A  definite  topic  of  further  research  is  to  determine  what 
values  of  9  and  ifi  may  be  used  with  reliable  results  for  the  D(u) 


mode 1 . 
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8.  CONCLUDING  REMARKS 

Here  we  summarize  what  we  have  done,  what  we  have  not  done,  and 
make  suggestions  for  further  investigation  and  implementation.  We 
begin  with  the  mathematical  results. 

8.1  The  Mathematical  Problem 

With  Parzen's  (1961,  1979,  1980)  time  series  regression  models 
using  the  quantile  function  we  have  provided  new  theory  and  methods 
for  studying  how  two  samples  differ  in  location  and  scale  parameters 
and  at  all  quantiles.  These  methods  assume  continuous  increasing 
Fq  and  all  but  the  exponential  densities  were  symmetric  about  zero. 
Nevertheless,  the  body  of  simple  linear  rank  theory  and  methods  for 
location  and  scale  parameter  differences  has  been  expanded  and  made 
more  complete.  The  test  obtained  using  Parzen's  (1980)  D(u)  model  is 
a  simultaneous  location  and  scale  test  when  the  two  population  dis¬ 
tributions  are  a  location  and  scale  shift  of  a  common  distribution. 
These  tests  are  nonparametr ic ,  but  still  provide  estimators  of  tie 
location  differences  by  9  and  the  scale  differences  by  when  f  is 
the  correct  density.  We  also  give  computational  formulas  for  9  and 
•(/  simultaneously  or  individually  for  several  underlying  densities. 

By  examining  the  residuals,  D(u)  -  D(u) ,  for  a  finite  set  of 
u  values  we  are  given  some  guidance  in  selecting  the  underlying 
density  f  which  seems  to  model  the  data  better  than  others.  This 
also  provides  a  criteria  for  selecting  which  set  of  nonparametr ic 


V» 
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tests  and  estimators  to  use. 

There  is  always  the  possibility  that  D(u)  will  not  fit  D(u)  well. 

By  examining  D(u)  -  D(u)  at  different  values  of  u  we  may  see  which 

quantiles  contribute  more  to  the  deviation  of  D(u)  from  D(u) .  The 

willing  user  may  also  suggest  his  own  f  and  go  through  the  estimation 

and  testing  calculations  to  obtain  another  f  which  may  model  the  data 

better.  In  any  event,  our  significance  levels  are  correct  as  given  in 

Corollary  2.2  and  Theorem  2.7.  The  other  confusing  possibility  would 

be  that  several  f  would  fit  the  data  well.  In  this  case  we  would 
o 

need  to  check  if  all  8  and  1 1>  were  consistent  and  remark  that  a  larger 
sample  size  will  be  more  discriminating.  Eubank’s  (1979)  optimal 
uA  for  a  given  density  may  become  valuable  in  this  discrimination 
process.  Also  of  interest  here  are  the  alternate  models  of  D(u) . 

We  discuss  what  consequence  these  techniques  have  for  the 
scientist  who  analyzes  two  sample  data. 

8.2  The  Scientific  Problem 

Given  two  samples  from  an  experiment,  often  a  treatment  and 
control  group,  the  scientist  is  faced  with  determining  how  the 
two  samples  differ  and  trying  to  model  and  explain  that  dif¬ 
ference.  This  implementation  of  Parzen's  (1980)  models  provides 
a  general  location  and  scale  approach.  The  test  statistics 
and  estimators  for  parametric  differences  are  easily  cal¬ 
culated  frcm  the  ranks  of  the  X  observations  in  the  combined 
sample  of  X's  and  Y's  and  fQ.  Where  an  existing  simple  linear  rank 


*-  \ 

**  . 
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statistic  is  a  linear  transform  of  0  or  there  are  finite 
sample  size  tables  which  may  be  used  if  n  and  m  are  not  large. 

There  are  also  several  graphical  comparisons  of  the  two  samples 
provided.  The  slope  of  D(u)  provides  a  likelihood  ratio  type  com¬ 
parison  function  of  the  two  samples  at  each  quantile.  We  also 
provide  a  graphical  comparison  of  the  differences  of  the  two 
samples  at  each  quantile,  l.e.,  Q  -  Q  . 

X  A 

Besides  the  graphical  comparisons  for  one  may  develop 

tests  of  H  :  F  =  G  versus  a  difference  in  location  and  scale, 
o 

Also  provided  are  the  tests  and  estimates  of  location  and  scale 

VUl  °2~°1 

difference  by  considering  0  -  -  and  di  =  -  .  In  addition, 

°1  ,  °1  - 

whether  these  differences  are  zero  or  not,  D(u)  -  D(u)  provides  a 

criteria  for  choosing  an  adequate  density  to  model  the  data.  One 

may  also  leave  one  of  the  differences  out  and  see  whether  the  other 

difference  alone  is  an  adequate  model  of  the  differences  of  the 

two  samples.  That  is,  one  may  use  D(u)  -  u  *  ( 1— A >  0  f^Q^u)  or 

D(u)  -  u  ■  (1-A)  *p  Q  (u)  f  Q  (u)  rather  than  both  terms  at  once, 

o  oo 

as  illustrated  with  the  exponential  in  Theorem  2.3.  In  this  case, 
we  merely  drop  some  terras  from  the  distributions  developed  for  D(u) , 
D(u)  -  D(u),  and  4^(u) . 
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